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Many of us do not realize the rate 
of growth in the segment within psy- 
chology devoted to problems related 
to industry. In 1949, Dennis pointed 
out that there were only 105 Fellows 
of the APA in Division 14, and that 
this was less than one per million of 
national population. By 1958 there 
were 250 Fellows and 409 members. 
When this is made a fraction against 
the denominator of national popula- 
tion statistics, it is still a very small 
one. However, it is a tremendous rate 
of growth. By 1958 Division 14 was, 
in terms of Fellows, the fourth larg- 
est division with 7% of the total num- 


1 This paper is part of a larger plan of 
papers, written at the request and suggestion 
of the Ford Foundation Program in Eco- 
nomic Development and Administration. 
Companion papers by Paul Lazarsfeld and 
Robert Dahl appear in the American Journal 
of Sociology and The American Political Sci- 
ence Review, respectively. The aim of the 
three, covering psychology, sociology, and 
political science, is to indicate research areas 
in the social sciences related to problems of 
business and industry. The method will be to 
illustrate research areas by describing con- 
ceptual development in a kind of ‘‘main cur- 
rents of thought”” manner. The tremendous 
volume of publication in the field makes any- 
thing like an exhaustive review impossible; it 
is only possible to pick special cases, and these 
where they illustrate turning points in thinking 
or point to research possibilities in the future. 
Further, it is sometimes necessary to make 
arbitrary limitations following traditional aca- 
demic book-keeping practice; for example, 
where industrial social psychology shades im- 
perceptibly into industrial sociology. 


ber of Fellows; in terms of rate of 
growth its increase of 238% since 
1948 is second only to the school psy- 
chologists’ rise of 339%. Moreover, 
Division 14 does not reflect much of 
the psychological work—in social 
psychology, communication, role the- 
ory, and the like—relevant to in- 
dustrial problems. If it were possible 
to take these into account, the 
growth and proportion would be 
even more striking. 

Dennis also pointed to the fact 
that in 1948 over half the Fellows of 
Division 14 were employed in col- 
leges and _ universities, suggesting 
that this indicated surprisingly little 
contact with industrial problems. 
Roughly the same kind of population 
figure holds today, but one might 
draw another implication from it. In 
spite of its adjectival title, suggestive 
of an ‘‘applied’’ character, industrial 
psychology is and always has been 
primarily an academic discipline. 
Rooted in psychological concepts in 
individual differences, motivation, 
social and experimental psychology, 
its inception and the course of its 
development have been primarily de- 
termined by the developing nature 
of psychological theory, rather than 
the pressures of exigencies within 
business and industry. The business 
of applying psychology to industry 
has been the application of psychol- 
ogy as it developed as psychology, 
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rather than a reshaping of the field 
by industrial pressures. A good text 
for a course in industrial psychology 
could well be a history of psychologi- 
cal problems as they appear in the in- 
dustrial setting rather than a history 
of industrial problems. To be sure, 
outside developments are evident in 
the history of industrial psychology. 
The development of the Civil Service 
gave impetus to studies of merit rat- 
ing and of selection. The creation of 
collective bargaining initiated a 
whole new set of psychological prob- 
lems (Haire, 1957a) and the com- 
plexities of war-time equipment 
spurred human engineering studies. 
More recently, the increase in size of 
organizations, the tight labor market 
which put emphasis on nonfinancial 
incentives, and the increasing ra- 
tionalization of jobs gave promi- 
nence to a set of social psychological 
problems. 

External pressures have had an in- 
fluence, but the development has 
been primarily within the subfield 
and within psychology. It is in these 
terms that this paper will try to draw 
the developing lines. The main de- 
terminants seem to be growths within 
the general body of psychological 
thinking in major areas such as mo- 
tivation, individual differences, and 
the like. Within this framework 
there are a separate set of narrower, 
more specific subfields of industrial 
psychology itself which have a de- 
velopmental history of their own. Fi- 
nally, cutting across these are the 
historical developments outside the 
field which exert a special influence 
from time to time. 


THE SPECIAL AREAS OF 
INDUSTRIAL PsYCHOLOGY 
It seems useful to distinguish three 
traditions within the field. They 
have quite different conceptual bases, 
their developmental histories are not 


at all the same, and they have rela- 
tively little contact with one another. 
The first is the field of personnel psy- 
chology, flowing from the tradition 
of individual differences; the second 
is human engineering, growing out of 
applied experimental psychology ; and 
the third is less compact, what might 
be called industrial social psychology, 
although it includes some individual 
problems of motivation and the like. 

The first two—personnel psychol- 
ogy and the human engineering ap- 
proach—stand sharply apart. The 
approach from the side of individual 
differences, applied to industry, seeks 
to identify the best man for the job. 
It uses essentially a_ correlational 
method and viewpoint, and the at- 
tention is focused on within-group 
variance in sharp distinction to the 
human engineer whose main interest 
is in the between-group variance, or 
the effects of treatments. The aim of 
the engineer is not to find the right 
man for the job, but as Chapanis 
(1952) says, ‘‘to make the job fit the 
man—any man.”’ Cronbach puts the 
distinction well (1957) when he says 
the experimentalist attacks nature, 
seeking to modify the environment, 
while the correlationist approaches 
nature like a lover, taking her as she 
is and studying her as such. There 
is more than a deep conceptual dif- 
ference between them; they are dia- 
metrically opposed in practice. Every 
step of success the engineer has in 
making the job fit Everyman de- 
stroys part of the reason for being of 
the selection tester, who depends on 
the fact that mot everyone can do the 
job equally well. In the reverse di- 
rection, it is also true, but less com- 
pelling, that if the effectiveness of 
selection had been greater it would 
have left less room for the work of 
the human engineer. sThis is not quite 
so true as the obverse, since the de- 
sign of equipment would always be 
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useful in special situations—where 
the task required skills which were in 
scarce supply, in a tight labor mar- 
ket, or in a military situation where 
the characteristics of the labor force 
may be inflexible, and the demands 
of the equipment beyond much im- 
provement by classification. In a 
way it is strange that the liberal tra- 
dition of remaking the world to make 
it fit Man better, in contrast with the 
relatively passive caste-ridden Dar- 
winian approach of the correlationist, 
should come from something called 
Engineering Psychology. The social 
philosophy seems to be grounded less 
in ideological conviction, however, 
than in a response to the complexities 
and demands of equipment. 

The philosophy of the social the- 
orist is a little harder to put concisely 
than the other two, but he stands 
quite far apart from them. The his- 
torical background is more complex. 
It includes some sociological tradi- 
tions and a group of diverse psycho- 
logical fields. The classic Manage- 
ment and the Worker (Roethlisberger 
& Dickson, 1939) marked the clear 
emphasis on certain aspects of group 
structure and_ social motivation. 
Moreno added others, and Lewin and 
the group’ dynamicists still others. 
Some of the philosophy of the in- 
stinctivists has remained in the in- 
terest in motivation and attitude 
measurement. and the communica- 
tion theorists, in dealing with both 
structure and draw on a 
variety of fields. There is a phenom- 
enological trend apparent as the field 
of social perception is brought to 
bear on industrial problems. In some 
ways the social approach seems closer 
to that of the engineer than to the 
correiationist. Both the social psy- 
chologist and the human engineer are 
interested in how the job may be re- 
built to maximize the utilization of 
human potential. The social theo- 
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rist, however, turns from the rela- 
tively simple sensory and motor 
problems of the human engineer to 
manipulate quite different variables. 
He will try to arrange work situa- 
tions to provide maximum motiva- 
tional satisfactions and to structure 
groups so that their strengths are not 
barriers but aids to the accomplish- 
ment of the organization’s produc- 
tive objectives, and the like. 

These, briefly, are the three sub- 
areas we will consider. Let us look at 
each of them in the light of their con- 
ceptual history and the research 
problems they have raised. 


HUMAN ENGINEERING 


Historically, the human engineer- 
ing approach is probably best traced 
to the early “conditions of work” 
interest. Here, notably, the very 
general environment was the center 
of attention in studies of fatigue, 
lighting, effects of music, and the 
like. The plant tended to be the unit 
of variation and the work group the 
unit of response. In their original 
the Hawthorne _ studies 
(Roethlisberger & Dickson, 1939) 
followed this pattern as late as the 
1930's. Later, attention narrowed to 
a more nearly individual basis. Al- 
though it does not quite fit chrono- 
logically, the spirit of the time studies 
of Taylor and the motion studies of 
Gilbreth helped to narrow the inter- 
est to smaller processes. 

The present interest—for example, 
in equipment design—uses the indi- 
vidual as the unit of response anal- 
and some intimate contact- 
apparatus as the unit of controlled 
variation. The work begins with 
simple questions like, “‘how long 
should the handle of the crank be?” 
and “how big should the type be?” 
and goes on to much subtler prob- 
lems of psychomotor control and the 
display of information. Methodo- 
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logically, a real advance was made by 
the emphasis on the identification of 
error variance associated respectively 
with man and machine in complex 
systems. 

The basic approach of the engineer- 
ing psychologist has been to ask, 
“what abilities does man have?”’; 
“what does the end product re- 
quire?”’; and ‘how can the machine 
be built to go between the two?”’ It 
has some interesting pressures from 
implications for the social system 
outside of psychology. It fits in well 
with the general tendency to ra- 
tionalize jobs in the interest of min- 
imizing the cost of turnover. If the 
job could be built ‘to fit the man— 
any man,” it also made it easier for 
the industrial organization to absorb 
variations attendent on business cy- 
cles by appropriate variation in rela- 
tively homogeneous personnel units 
requiring a minimum of recruitment 
and training. As the development 
popularly called automation contin- 


ues—the control of machines by ma- 
chines—the picture changes. As 
Drucker (1955) points out, with the 
high cost of automation the area of 


business risk shifts. It becomes im- 
portant to stabilize production—pref- 
erably at maximum output, but im- 
portantly at a stable figure. The area 
of psychological research interest 
tends to shift from the producer to 
the consumer. Ideally, a product de- 
signed to elicit high motivation to 
purchase, with rapid obsolescence 
and a fast waning of interest in the 
item coupled with an equally fast rise 
in the desire for another one like it is 
the psychological insight demanded 
to make automation work economi- 
cally. 

The rationalization of jobs to fit 
Everyman has had implications in 
both the other subareas of industrial 
psychology. On the one hand, to the 
extent to which individual differences 
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are minimized, the individual's feel- 
ing of his peculiar contribution is re- 
duced, and this diminution in ego- 
istic need-satisfactions has formed a 
fair part of the interest of the indus- 
trial social psychologist. On the 
other hand, the rationalization of 
jobs for Everyman is a tremendous 
waste of existing individual differ- 
ences, though the correlationists have 
not made an issue of it. Deskilling 
benefits companies taken singly, but, 
to maximize national productivity, 
it would seem wiser to utilize the 
peaks of individual abilities rather 
than the mid-height of the mean. By 
and large, also, though it is not a 
necessary implication, the changes 
resulting from human _ engineering 
have nét maximally utilized the po- 
tential effects of training. 

The human engineer has also be- 
gun to challenge the basis of the tra- 
ditional industrial engineer. In a 
sense, the engineer first invaded the 
field of the psychologist by recom- 
mending certain behavioral proce- 
dures on the basis of the findings of 
time and motion studies. This ap- 
proach, however, traditionally treats 
the machine as a constant and man 
as a variable, exploring the optimal 
movements most suitable to the con- 
tinued operation of a particular ma- 
chine. Except for studies of plant 
layout and work-bench layout, the 
engineer has typically sought to 
change the worker. Engineering psy- 
chology, since it is a planning of the 
machine on the basis of the operator’s 
characteristics, was bound to chal- 
lenge this approach. A beginning of 
such a challenge can be seen, for ex- 
ample, in J. S. Brown and W. O. 
Jenkins’ paper (1947) in which they 
propose a reclassification of motor re- 
sponses into static reactions, posi- 
tioning reactions, and movement re- 
actions. These are not divided in 
terms of observed performance on a 
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job, like, for example, the ‘“‘carry 
empty” ‘‘carry loaded” categories of 
the Therblig notation, but rather are 
based on hypothesized characteristics 
of the operator, and, as such, would 
provide an entirely different ap- 
proach to the problem of time and 
motion study. Likewise, Birming- 
ham and Taylor (1954) also suggest 
ways to analyze performances which 
led to entirely different kinds of cate- 
gories from the Therblig. Although 
it has not developed fully, the view- 
point has the seed of a broadly dif- 
ferent approach to work methods. 
It would search for categories in the 
characteristics of behavior, and tak- 
ing these as givens, flesh out the gen- 
eral principles on which machines 
could be designed to utilize them. 
Specific empirical research areas in 
the field have, in general, fallen under 
three heads: studies of the optimal 
environment, studies of information 
display, and studies of equipment de- 
sign (Fitts, 1951). It is in the nature 


of the field that the published reports 
are highly specific and tend to have a 
fragmentary character whose integra- 
tion is only achieved through an over- 
view of the concepts of the whole 


field. The volume of published re- 
ports on individual determinations is 
tremendous, even outside of the con- 
ventional professional journals. The 
Navy (U. S. Navy Special Devices 
Center), for example, in 1955 pub- 
lished a bibliography of 376 refer- 
ences to unclassified project reports 
in these areas. 

One problem within the area has 
shown tremendous growth and seems 
to have the broadest possible impli- 
cations for future research. Chapanis, 
Garner, and Morgan (1949) gave a 
clear and extended treatment to 
the problem of kinds and sources of 
error in man-machine systems. The 
identification of constant and vari- 
able errors and of errors associated 
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with the man and the equipment 
early gave the human engineer a 
kind of diagnostic tool to identify the 
places where work would be most 
profitably applied. Subsequently, 
this kind of approach led to an anal- 
ysis of systems errors in which all 
the specific problems of the field con- 
verged: the design of the controls 
may maximize or minimize the varia- 
bility of the operator’s response, the 
display of information similarly in- 
fluences the error, and the serial 
order of responses within an operator 
or from operator to operator becomes 
an important variable. As this tend- 
ency to analyze complex man-ma- 
chine systems in terms of minimizing 
systems errors grows, the engineer 

pproaches a field which seems, at 
irst glance, to belong more properly 
in the bailiwick of the social theorist: 
the problem of organization theory. 
Within relatively small units—radar 
warning systems, fire-control sys- 
tems, and the like—this approach 
has designated the optimal organiza- 
tion for minimizing error in the sys- 
tem. As it would apply to a larger 
organization—for instance, a_busi- 
ness or an industry group—the work 
would be cumbersome and volumi- 
nous, and it would require some new 
techniques for the descriptions of ac- 
tivities in various parts, but eventu- 
ally an organization theory for in- 
dustry should flow from the ap- 
proach of the human engineer. With 
its history of concentration on the 
display of information and of char- 
acteristic responses to certain kinds 
of messages, it probably fits most har- 
moniously with the decision theorists’ 
work on theories of teams (Helmer, 
1957; Marschak, 1952; Radner, 
1953). The introduction of problems 
of uncertainty in such systems (Car- 
ter, Meredith, & Shakle, 1954) 
changes the problem considerably 
from the usual formulation of the hu- 
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man engineer, but, still somewhat 
paradoxically, one basis for a future 
theory of social organizations seems 
to lie in the field. It is perhaps worth 
pointing out that the approach from 
the analysis of systems error in com- 
plex man-machine systems gives a 
particular kind of organization—i.e., 
one that minimizes error—and that 
it will differ sharply from the social 
theorist who may, for example, be 
interested also in maximizing per- 
sonal and organizational goals simul- 
taneously. 
PERSONNEL PSYCHOLOGY 

In its simple base, the field of per- 
sonnel psychology rests on a correla- 
tional relationship between a_nor- 
mally distributed predictor variable 
(which may or may not be simply re- 
lated to the skills and abilities re- 
quired on the job) on the one hand, 
and another normally distributed 
measure of criterion performance. 
The equally simple matching task is 
to eliminate, on the basis of the rela- 
tively inexpensive predictor variable, 
those with low likelihood of success 
in criterion performance. 

As the field developed, all three— 
the predictor, the criterion, and the 
match—lost their simplicity. The 
predictor went from simple motor 
skills and primary mental abilities to 
job knowledge, and finally to in- 
terests, motives, background, and 
even trainability. The criterion 
broadened into the complexities of 
job families, job description, job eval- 
uations, merit ratings, and multi- 
dimensionality, and has become the 
knottiest problem in the field. 
Matching went from the simple 
mathematics of screening to the 
much more involved problems of 
classification and team construction. 

In some ways the outstanding fact 
in the history of the simple selection 
procedures is the relatively long pla- 
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teau of no real progress after a period 
of initial improvement. Cronbach 
(1957) puts it that none of the refine- 
ments since 1920 have improved 
practical predictors by a noticeable 
amount. This is a rather extreme 
view, but it certainly seems likely 
that if one could plot all the validity 
coefficients ever reported, they would 
probably form a characteristic learn- 
ing curve, with a rapid initial rise and 
a long plateau as validities settle be- 
low an unattainable asymptote of .40 
or .45. 

Hull (1928) suggested that a prac- 
tical limit for validities might be in 
the neighborhood of .50. Nothing in 
the history of selection testing has 
radically revised this figure after 30 
years. The main source of limitation, 
however, seems to be with the proper 
and adequate definition of the crite- 
rion variable ther than with either 
the predictor or the nature of the 
match. I[t is almost certain that here, 
as in other areas referred to in this 
paper, further differentiation of the 
criterion is the eventual path to re- 
search progress. Wallace and Weitz 
pointed out in 1955 that the criterion 
problem leads all other topics in in- 
dustrial psychology in lip service, but 
trails in work reported. The problem 
is especially difficult in this area. 

The two problems of relevancy and 
reliability of criterion measures are 
relatively straight forward. The 
problem of the identification of mul- 
tiple criteria and their subsequent 
combination and weighting is prob- 
ably the most difficult but the most 
fruitful. The so-called “‘dollar’’ cri- 
terion, or the simple criterion of pro- 
ductivity seems inevitably to hide 
the details of the psychological prob- 
lem involved. On the one hand, such 
criteria leave out other important rel- 
evant measures—e.g., turnover, 
grievances, etc. On the other hand, 


and perhaps more important, such 
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criteria are themselves so deter- 
mined by a multitude of factors (var- 
ious skills and abilities, motivations, 
working conditions, and the like) 
that their prediction will probably 
never go far beyond the present 
level. Only the detailed differentia- 
tion of the criterion and its eventual 
reconstitution seem to hold promise 
for raising the general level of valid- 
ities. 

The problem of classification—op- 
timizing the simultaneous match of 
several men and several positions— 
has been a difficult one. Thorndike 
(1949) began the mathematics for a 
simple case, and, more recently, iter- 
ative solutions of the linear program- 
ming type have shown promise. 
Here, also, the maximization of the 
matrix values, while a great advance, 
can never be better than the criterion 
used to scale the values concerned. 

Even more discouraging in the 


mathematics of simple selection are 
the implications of a paper by Brown 


and Ghiselli (1953) which is too 
little noticed in the field. Working 
within the simple algebraic relation- 
ships of the correlation between pre- 
dictor and criterion, they provide a 
set of values in which one may read 
the percentage of improvement in 
productivity of a selected over an un- 
selected population. The improve- 
ment depends on (a) the validity, 
(b) the cut-off, and (c) the variance 
in criterial performance. Putting in 
hypothetical but reasonable figures, 
the result is discouraging. Given a 
frequently encountered validity of, 
say, .30, and the ability to reject 
50% of the applicants (having al- 
ready rejected on all other bases but 
the test), and with a ratio of best to 
worst in the criterion of 1.5 to 1, the 
improvement in productivity of the 
selected group over the unselected 
will be only about 3%. Even 3% is 
not to be disregarded as a practical 


175 


matter, but it is not the universal 
panacea that many business men 
have dreamed of—a dream that 
many psychologists, who should have 
known better, have encouraged. 

Several specific implications seem 
to flow from the Brown and Ghiselli 
paper. For one thing, in practical 
terms, the tremendous leverage 
gained by shifting the cut-off is obvi- 
ously related to the state of the labor 
market. In the past few years we 
have had a very tight labor market 
where it was necessary to hire almost 
everyone to fill jobs. Now, with the 
war-time bulge in the birth-rate com- 
ing out of schools, considerably more 
selectivity is possible, and the practi- 
cal value of even relatively low 
validities increases. We should note, 
also, that these values refer to a 
simple yes-no selection, and do not 
take into account the more compli- 
cated classification problem. 

Again, the tendency to rationalize 
jobs mentioned above cuts across the 
selector’s effectiveness. No precise 
data are available on the ratio of best 
to worst in industrial practice, but in 
highly mechanized production-line 
situations the ratio is said to be about 
1.1 to 1. Such a value gives little 
power to selection. As this trend con- 
tinues to reduce the cost of turnover 
and training, the possible improve- 
ments by selection disappear. It is 
just here that the human engineer’s 
attempt to rebuild the job so all can 
do it equally well can come in direct 
conflict with the tester. 

One area of testing probably bene- 
fits from all the variables in the 
Brown and Ghiselli function—the so- 
called assessment techniques for the 
early identification of high-level tal- 
ent. Although the validities are typi- 
cally much lower, the variance of 
performance in, for example, execu- 
tives, is very much greater than in 
lathe operators, and the percentage 
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selected is very much smaller as the 
triangle of the hierarchy narrows 
toward the top. These last two fac- 
tors combine to outweigh the present 
validities and give much more power 
than in the case of selection for cleri- 
cal and manual jobs. In the present 
practical situation there is one special 
danger here. Many of the tests used 
for personality assessment in the in- 
terests of selecting executives are 
general personality tests which are 
not validated for executive perform- 
ance but for the identification of per- 
sonality traits. The relevance of 
these characteristics to successful 
performance is often made by a kind 
of intuitive judgment of what kind of 
person one would like to have on the 
job. Since it is impossible to forecast 
the demands that will be placed on 
the organization in the future, this 
seems to be a dangerous putting of 
all of one’s personnel eggs in one 
basket. Fortunately, at the present, 
the validities are low enough so that 
the consistent use of almost any of 
the personality tests will still provide 
enough error variance to protect the 
organization against poor judgment 
about the qualities it thinks it is se- 
lecting. 

The increase in productivity af- 
forded by selection also leads one to 
think of the possible improvement of- 
fered by training. Unfortunately, the 
literature on the measured effect of 
training programs in industry is 
sparse. This is particularly surpris- 
ing, perhaps, in a science where such 
a large proportion of research and 
theoretical development has been in 
the field of learning. Like the crite- 
rion problem, a great deal is said about 
the advisability of studying training, 
but relatively little has been done. 

There are some studies of the ef- 
fectiveness of training showing posi- 
tive results (Maier, 1953; McGehee 
& Livingston, 1954; Wallace & Twit- 
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chell, 1953) and some showing no ef- 
fect (Baxter, Taaffe, & Hughes, 
1953); again the criterion problem is 
an ever-present difficulty. Several 
studies have referred the effect of 
treatment back to a somewhat vague 
determinant known as the ‘‘climate”’ 
of the group or organization (Buchele, 
1955; Fleishman, 1953; Jennings, 
1955). In the absence of precise defi- 
nition of the variable or the way in 
which it operates, this has not proved 
fruitful, although it probably points 
to something which must be dealt 


with later. Certainly much of our 
training in industry takes place 
within a classroom, is measured 


there, and probably never shows an 
effect on the workroom floor (Fleish- 
man, 1953). 

Edgerton (1955) suggests some 
solid advances in the field—‘watch 
and do” training films, the Air Force 
work on team training, the Navy’s 
work on where training devices are 
needed. In addition, the growth of 
the National Training Laboratory 
and the Western Training Labora- 
tory have focused the group dynami- 
cists’ interest on the problem. Still, 
there is very little in the way of 
evaluation of the effectiveness of 
training. 

There are a number of exhortatory 
articles urging an evaluation of train- 
ing but relatively few comparable to 
Edgerton’s in detail of review and re- 
search suggestion. He proposes a de- 
tailed research program including 
measurement of predictor abilities, 
variation of training methods, and 
criterion measures. In this way it is 
somewhat similar to Cronbach’s pro- 
posal (1957) that a matrix containing 
both treatments (training methods) 
and aptitudes is needed and that the 
interaction factors may be maximally 
effective. It should be pointed out 
that the approach implied in these 
two suggestions is, to some extent, at 
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odds with the pure correlationists’s 
approach outlined earlier and is a 
step toward a rapprochement with 
the experimentalist. This combina- 
tion of treatment and individual dif- 
ferences has not been explored in 
terms of job design and equipment, 
but there is no reason why it should 
not be. 

For example, micromotion tech- 
niques are well worked out for study- 
ing job performance. A detailed 
study of the way work is done by in- 
dividuals high on the predictor and 
low on the criterion (or vice versa) 
might yield real possibilities for the 
redesign of performance leading to 
the productive criterion. Particu- 
larly when the ability required is 
scarce, such a procedure would be 
useful. In terms of the national pro- 
ductivity, a national sample of, for 
instance, psychomotor skills and the 
redesign of jobs to take advantage of 
them would seem both feasible and 
fruitful. 

The problem of training cuts across 
the selection problem more directly. 
Most validation studies are con- 
ducted against a fixed training crite- 
rion, i.e., the simple pass-fail criterion. 
However, we have little research on 
the prediction of trainability in terms 
of the correlation between a selector 
and the slope of the learning curve. 
In many cases a lower slope and a 
higher eventual level seems a real 
possibility, and for long employment 
the prediction of both slope and even- 
tual improvement seems important. 
Again, most validities are against 
proficiency criteria taken very early 
in job-life compared to the typical 
tenure of a producer. There is some 
evidence that validities and intercor- 
relations taken at different periods of 
employment would give very differ- 
ent weightings in a predictive bat- 
tery, and Fleishman and Hempel 
(1954) have suggested a change in the 
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factorial content of motor skills with 
practice. All of these points seem to 
suggest that in addition to differenti- 
ation of the predicted criterion in 
terms of job-relevant factors the 
elaboration of its psychological com- 
ponents might also be fruitful. 

In practical terms, the interview is 
still the most widely used selective 
device. Some progress has been 
made in the development of pat- 
terned interviews, biographical data 
blanks, and job knowledge tests to 
support the interviewer, but both be- 
cause of its wide use and relatively 
little development, this area seems 
particularly r°ve for research. Since 
Meehl’s distinction between clinical 
and actuarial approaches, the direc- 
tion of development in terms of a 
validation of the interviewer himself 
on an actuarial basis seems the like- 
liest to bear fruit. The study of the 
interviewer and validating him as an 
instrument are surprisingly lacking in 
published research. 


INDUSTRIAL SOCIAL PSYCHOLOGY 


As has been suggested above, it is 
harder to give a compact historical 
background for this subarea than for 


the others. An historical review has 
recently been given elsewhere (Haire, 
1954) and will not be repeated here. 
Likewise, it is harder in this area to 
give the central theme that charac- 
terizes the field. Several of the 
themes must be identified and will be 
dealt with in detail below. The first 
of them is the interest in group proc- 
esses flowing from the impact of 
Lewin, Moreno, and the Mayo school. 
These include the interest in socio- 
metric structure of the group, roles, 
resistance to change, small group 
dynamics, and the social organiza- 
tion of the factory. The second is an 
interest in “the psychology of the 
other one’”’ in the verstehen tradition, 
with an emphasis on an approach 
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through perception and a detailed 
understanding of the world of experi- 
ence of the subject. A third is the 
presence of a broad humanistic value 
which seems to run through the field. 
It is often not explicit, but lurks im- 
mediately below the surface. For ex- 
ample, the “human relations’’ point 
of view often seems to suggest that an 
increase in need satisfactions at work 
will increase productivity (Brayfield 
& Crockett, 1955); often, however, it 
seems as if the suggestion is that even 
though it may not increase produc- 
tivity, an increase in need satisfac- 
tions for the worker is a social good. 
Again, Argyris (1957) and others 
(Katz & Kahn, 1952) seem to imply a 
similar calculus: it is possible and ad- 
visable to reduce the achievement of 
organizational goals in order to in- 
crease the achievement of individual 
need satisfactions. A fourth stream in 
the field is a shifting in the manner of 
dealing with motivation. Taylorism 
and the vogue of incentive pay sys- 


tems is closely related to the 18th 


century economic man. With more 
sophisticated psychological theory 
of motivation this interpretation 
gives way to one in which the drives 
are both more differentiated and in- 
ternalized. Katz and Kahn (1952) 
emphasize the internalization of re- 
ward. The differentiation of motives 
is more complex. In a sense the idea 
of economic man gave way to a re- 
fined instinct theory. Veblen’s Jn- 
stinct of Workmanship (1918), Tead’s 
Instincts in Industry (1918) and Wil- 
liams’ Mainsprings of Man (1925) 
are early examples of the transitional 
state. Brayfield and Crockett (1955) 
point to, and Kornhauser and Sharp 
(1932) illustrate, the period when 
the instinct argument was being 
fragmented into complex motiva- 
tional analyses and attitude surveys. 
With motivational sophistication, the 
complexities of modern theory ap- 
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pear, with, for example, Maier’s em- 
phasis on frustration (1955), Stag- 
ner’s interest in the conflict of dual 
allegiance (1957) and the somewhat 
Freudian analysis of McGregor and 
others (1944). 

External influences have been par- 
ticularly pressing in this area. The 
growth in size of industrial organiza- 
tions has tended to destroy mana- 
gerial reliance on old face-to-face 
relationships and to force a consid- 
eration of small group pressures and 
of formal characteristics of large 
organizations. The influence of ra- 
tionalization of jobs has been men- 
tioned; a reduction of emphasis on 
individual skill raises problems re- 
lated to the kind of satisfactions one 
finds at work. It has also cut across 
the problem of group structure; for 
example, the disappearing role of the 
foreman is partly due to the changing 
organization of skills and responsi- 
bility. Collective bargaining has 
introduced new group problems of 
primary allegiances, and a rich field 
for attitude study as well as the in- 
process study of the bargaining itself. 
The union and its structure has be- 
come a subject for study, quite apart 
from its relation to management. In 
addition to these there are some 
broad social currents related to the 
developments in industry. One of 
these is the increasing profession- 
alization of management which means 
more than a simple specialization of 
function. It is the development of a 
group of specialists not primarily re- 
lated to the product, or, indeed, to 
production, but to the administra- 
tion of large complex social organiza- 
tions and, especially, to the problems 
of dealing with people within them. 
Partly out of this has grown an in- 
terest in a proper philosophy of man- 
agement—an asking of “why” in- 
stead of the traditional ‘‘how”’ which 
should lead the field to the broader 
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problems of social philosophy. As the 
corporation grew, social psychologi- 
cal problems arose with increase in 
its size. In addition, however, addi- 
tional problems arose from the fact 
that the legal entity of the corpora- 
tion and its traditional forms of or- 
ganization were frozen when size, 
communication technology, distribu- 
tion technology, and automation of 
production were entirely different. 
In many ways, industry has been less 
flexible in adapting its organization 
to relevant technological changes in 
administration than other large social 
organizations such as the Armed 
Services. This inflexibility creates 
problems of its own. Still another 
general social interest arising out of 
industrial organization is the current 
concern with conformity and the 
Organization Man (Whyte, 1956) 
which makes contact with, but did 
not seem to initiate, the psychological 
research now going on in the field 
(Asch, 1956; Crutchfield, 1955; Tud- 
denham, 1958). Finally, a whole 
group of basic societal values related 
to industry seem to have shifted, and 
to have had an influence on the gen- 
eral trends of interest in industrial 
social psychology. It used to be said 
that the United States was a success- 
oriented culture, with values seated 
in productivity, industriousness, and 
achievement. It is perhaps not ex- 
cessive to say that it is becoming an 
adjustment-oriented culture, with 
values stemming from fitting in with 
the group. We find contradictory 
norms of conformity and inconspicu- 
ousness, on the one hand, inhibiting 
the individual from the singleminded 
following of his own path, and, on the 
other hand, a rejection of industrious- 
ness as compulsive and achievement- 
oriented. In this connection, it is 
often said, jokingly, that the Ford 
Motor Co. has grown so that if “‘old 
Henry” were alive today there would 


179 


be no place for him in the organiza- 
tion. It is probably true, and his in- 
ability to fit in perhaps would stem 
from both of the values mentioned. 
Some of these social values—toward 
conformity and _ industriousness— 
have been due to the influence of 
psychological thinking; some of the 
psychological thinking about indus- 
trial problems has, in turn, been influ- 
enced by them. In any case, the field 
clearly feels the impact of these his- 
torical developments within industry 
and society. 


Large Group Organization 


In dealing with the problems re- 
lated to group processes and struc- 
tures, it is probably convenient to 
differentiate between large groups 
and small groups. Although there is 
really no fixed borderline between 
them, divergent interest seems clear. 
Historically, the interest in large 
group organization in industrial so- 
cial psychology stems from the view- 
point of Elton Mayo, and particu- 
larly from the major report of 
Roethlisberger and Dickson (1939). 
Mayo’s insistence that ‘‘man’s desire 
to be continuously associated ... 
with his fellows is a strong, if not the 
strongest human _ characteristic’ 
(Viteles, 1953, p. 181) leads off in the 
direction of social motives and small 
group processes, but his interest “‘in 
social organization of the factory” 
began an emphasis on the industrial 
organization as an autonomous mac- 
rocosmos. Within it there is an in- 
terest in role problems and motiva- 
tions for broad functionally defined 
classes, e.g., the white collar worker 
(Mills, 1956), and in stresses within 
the organization such as the relation 
between staff and line (McGregor, 
1948). The hierarchical character of 
the organization appears in connec- 
tion with prestige and status (Bar- 
nard, 1946), though, surprisingly, it 
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has not extended into the studies of 
communication, where one might ex- 
pect to find hierarchical barriers to 
the flow of information within the 
group. In contrast to the formal or- 
ganization, the informal organization 
which develops spontaneously out of 
the group itself has been seen as a 
source for individual need satisfac- 
tions denied by the formal structure 
(Selznick, 1958). The informal or- 
ganization is also the basis for several 
studies of rumor and communica- 
tion (Back, Festinger, Hymovitch, 
Kelly, Schacter, & Thibaut, 1950; 
Festinger, 1950), and for its role in 
morale (Arensberg & McGregor, 
1942). 

In dealing with the large group, the 
field of organization theory has 
shown great activity in recent years 
(Argyris, 1957; Bakke: 1950, 1953; 
Bakke & Argyris, 1954; Barnard, 
1950; Haire, 1955b; Herbst, 1957; 
Simon: 1952a, 1952b, 1955; Weiss, 
1956). The most frequent single 
thread running through this material 
is the conflict between the organiza- 
tion’s goal and the satisfaction of the 
individual’s motives. Is it possible to 
have unity of direction in an organ- 
ization without sacrificing autonomy 
in the individual? Is it possible to 
have an hierarchical chain of com- 
mand without sacrificing egoistic 
need satisfactions in the lower rungs? 
Is it possible to have planned ration- 
alized production without sacrificing 
active independence? McGregor 
(1944) suggests that it is. Argyris 
(1957), following a similar line, sug- 
gests that it is, but only at the cost 
of the organization’s objectives. It is 
not perfectly clear in Argyris’s treat- 
ment whether, on the one hand, he 
is dealing with the large organiza- 
tion, and the impeding effects of pol- 
icies, rules, roles, and formal nets, or, 
on the other hand, with the more 
intimate geography of the small 
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groups, and what Walker and Guest 
(1952) call ‘‘mass production as a 
code of law.”’ Barnard (1950) and 
Selznick (1958) both find some solu- 
tion in the existence of the informal 
organization to solve this problem. 
Simon (1955) and the decision the- 
orists in general tend to disregard the 
conflict, assuming a rational man, in 
many ways a more sophisticated 
counterpart of the old-fashioned eco- 
nomic man. This man now maxi- 
mizes strategies by relying on sub- 
jective evaluations and probabilities, 
but the original utility notion is not 
far submerged. 

This conflict between the individ- 
ual’s goals and the organization’s is 
part of the emphasis on broad hu- 
manistic values mentioned above. It 
often urged that the individual 
should be provided with more moti- 
vational satisfaction. Sometimes it is 
explicitly held that an increase in his 
satisfactions will make a more effec- 
tive producer. Sometimes it seems 
to be implicit that more satisfaction 
for the individual would be a Good 
Thing in any case; this is particularly 
true in dealing with socially valued 
satisfactions such as self-actualiza- 
tion, autonomy, and the like. In 
dealing with the industrial situation, 
this problem, which reappears in 
dealing with attitudes and motiva- 
tion, remains unsolved. Under the 
heading of organization it is sug- 
gested that the organization be op- 
erated in such a way that the indi- 
vidual’s and the _ organization’s 
achievement of goals be maximized 
simultaneously. If this means that 
there is some ideal form of organiza- 
tion which can simultaneously bring 
about the absolute maxima of both 
at the same time it is never explicitly 
stated. If it means, as it often seems 
to, that some of the organization’s 
goals should be sacrificed to increase 
the individual’s, the calculus is never 

















made specific, nor the social philos- 
ophy on which it rests made explicit. 
In current approaches, the other pos- 
sibility—that the individual's goals 
should be further sacrificed to in- 
crease the organization’s—seldom 
seems even to lie beneath the surface. 
This is probably because in the last 
20 years we have been in a very pro- 
ductive economy with relatively full 
employment, in which attention is 
directed less toward additional ma- 
terial output than to human values. 
Whatever the reason, the social psy- 
chologist unwittingly becomes a so- 
cia! philosopher as he chooses values 
underlying his analyses; the issues in 
the field would be clearer if these so- 
cial philosophies were developed and 
made explicit. 

Organization interest has turned 
to the relation between structure and 
function (Haire, 1955b; Weiss, 1956), 
and Herbst (1957) has reported some 
data applying an input-output anal- 
ysis to the description of social or- 
ganizations. In many ways the most 
surprising thing in,the field of or- 
ganization theory is the paucity of 
empirical data. For example, there 
seem to be no empirical histories of 
the growth of organizations in terms 
relevant to the social psychologist. 
In industry and economics there are 
a host of reports on growth and or- 
ganization in terms of invested capi- 
tal, dollar volume, and the like, but 
it is hard to find a history in terms of 
the number of people, what their 
functions were, their relations with 
one another, and the relative growth- 
rates of the parts and the whole. 
Such studies seem an essential base 
for organization theories if they are 
ever to move beyond the kind of pos- 
sibility-spinning that characterizes 
them today. 

The interest in the organization of 
large groups in industry has opened 
one area which seems to have par- 
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ticular promise for future develop- 
ment in the general psychological 
theory of motivation, and for a 
fruitful interaction between psycho- 
logical and sociological approaches. 
Several recent studies seem to be ap- 
proaching a kind of geographical ecol- 
ogy of motivation at work. Walker 
and Guest (1952) made a detailed 
interview study of workers on an as- 
sembly line. They found, for ex- 
ample, that the number of contacts a 
worker had with his fellows at work 
was not only related to his expressed 
satisfactions, but was also related to 
such company-relevant indices as 
turnover, grievances, and the amount 
of pay necessary to keep a man on 
the job. The assembly line almost 
necessarily is stretched out in a long 
thin line, and the very geography 
of production, originally designed 
with only technological merits in 
mind, is discovered to have liabilities 
in human performance. Similarly, in 
England, when technological ad- 
vances made it profitable to mine 
thin coal seams by a “long wall” 
method, the strength of both small 
face-to-face groups and of the larger 
inclusive group was weakened, with 
an increase in accidents and a de- 
crease in production. In the new sys- 
tem, instead of concentrating a small 
group at a seam-face to do a com- 
plete job of loosening, gathering, and 
transporting coal, a shift attacked a 
long seam, loosening coal for an en- 
tire period, after which another shift 
gathered and _ transported. This 
change so weakened the structure of 
the group (with loss to the company) 
that it was necessary to find other 
ways to rebuild the small group rela- 
tions with the larger organization. 
When this was done, accidents went 
down and _ productivity went up 
(Trist & Bamforth, 1951). Studies 
like these lead us to see the machine 
and machine-layout less as a tool for 
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production than as a part of the 
topography of the individual's work 
place, and, as such, provide a way to 
study the geographical ecology of 
motivation. They seem surprisingly 
similar to the findings of the urban 
sociologists as they investigate the 
change in social organization flowing 
from the change from the wheel- 
spoke towns of the railroad days to 
the modern strip-towns growing up 
along the highways. So far the two 
fields have not come together on the 
problem, but it seems profitable for 
them to do so. At the same time, it 
is worth noticing that such studies 
have implications for the human en- 
gineer’s analysis of work-systems, 
and the geographical ecology of 
motivation adds another dimension 
to the human variables with which he 
should deal in the design of work for 
the optimum utilization of human 
characteristics. 

Jaques’ Changing Culture of a Fac- 
tory (1952) brings up the same kind of 
problem, though his basis of analysis 
is more topological than geographi- 
cal. Similarly, some of the studies of 
rumor and communication demand 
a kind of grid-system against which 
to see the group processes. Davis 
(1953), in studying “grapevine” com- 
munication systems in industry, 
shows isolates in the network, both 
on the basis of geographical position 
and functional position in the group. 
Weiss and Jacobsen (1955), using a 
very large sociometric matrix of com- 
munication contacts, similarly identi- 
fied both isolates and liaison people 
within the group. In nonindustrial 
studies of rumor transmission (Back, 
et al., 1950) both geographical and 
functional position’ assume impor- 
tance in communication. Further 
study of the layout of the social 
group seems promising for such proc- 
ess analyses as well as for the ecology 
of motivation. 
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Small Groups 


In the interest in smaller groups, 
two lines are evident: the structural, 
which might be seen flowing from 
Moreno’s sociometry, and that of 
group dynamics, stemming from Le- 
win. The Lewinians raise the prob- 
lem of group cohesiveness, though the 
problem has not proven particularly 
amenable to attack either theoreti- 
cally or experimentally. They have 
focused considerable attention on 
group problem solving (Kelley & 
Thibaut, 1955) and group decision, 
and it is from the latter area that the 
biggest influence on industrial studies 
has come. Referring back frequently 
to Allport’s article on participation 
(1945), and, in the most detailed 
study of the group, drawing heavily 
on Lewin’s notion of quasi-stationary 
equilibrium (Coch & French, 1948), 
they have produced a group of in- 
stances in industry where participa- 
tive group decision overcomes re- 
sistance to change. While there is 
considerable evidence that participa- 
tion is effective, we have very little 
suggestion as to why it is. Somehow, 
this technique seems able to muster 
the forces which hold the group to- 
gether, and which often are barriers 
to the group’s action, and make them 
positive. Historically, the interest 
in small groups was refocused by the 
Hawthorne studies’ discovery of the 
effectiveness of the group in deter- 
mining productivity. They began, in 
the best engineering tradition, by 
asking, “will the introduction of rest 
pauses reduce fatigue and monotony 
and hence increase output?” Later, 
in an outburst of serendipity, they 
say, ‘it was clear that two essentially 
different sorts of changes occurred 
. .. those changes introduced by the 
investigators in the form of experi- 
mental conditions ...and a gradual 
change in the social interrelations 
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among the operators themselves and 
between the operators and the super- 
visors. From the attempt to set the 
proper conditions for the experiment, 
there arose indirectly a change in hu- 
man relations which came to be of 
great significance’ (Roethlisberger & 
Dickson, 1939, pp. 58-59). The new 
interest in the group’s power to fa- 
cilitate or inhibit change stems from 
this point. 

The field of resistance to change is 
an old one in industrial social psy- 
chology. In 1927 Angles published a 
detailed study of this topic and 
Mathewson devoted a book to it in 
1931, but its base in group processes 
was not clear until it was suggested 
in the Hawthorne studies and vigor- 
ously followed up by the Lewinians. 
Angles, for example, spoke of the 
signs of restriction of output and of 
its causes. He identified the reduc- 
tion in the variance of production 
figures as an indicator of restriction 
20 years before Lewin’s hypothesis 
of gradients of forces around levels of 
performance made it theoretically 
meaningful. Angles interprets re- 
striction chiefly in terms of individual 
factors, speaking of such things as 
fear of rate cutting, physiological 
factors, satisfaction with present 
earnings, and the like. He does men- 
tion (Angles, 1927, p. 250) that ‘“‘the 
practice seems to rest primarily on a 
strong sense of courtesy to one’s 
mate...’ and later, “loyalty to 
one’s work-mate is usually so strong 
that a higher position with better 
wages in the same factory is not de- 
sired if it would mean separation 
from one’s set of neighbors and 
friends. This is more marked in 
women than in men.... The herd 
instinct seems to operate in inverse 
ratio to the skill required for the 
work, and thus it quite commonly 
overcomes natural acquisitiveness.”’ 
Two things seem interesting in this 
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early quote: one, the fact that the 
force associated with the relations 
among workers was identified as part 
of the restriction of output and re- 
sistance to change, though it was not 
emphasized, and, two, the fact that 
it was still permissible to speak in 
instinct terms; the more modern 
terminology of social motives and 
restraining forces had not yet taken 
over. We seem clearly to have pro- 
gressed in the more modern state- 
ments. It is now possible to identify 
and manipulate some of the factors in 
the group. However, further work on 
group processes outside the industrial 
field should lead us to more precise 
determinations of the source and na- 
ture of the problem at work. 

In the industrial situation, the area 
of restriction of output is a somewhat 
strange case in which the very phe- 
nomenon has changed, partly be- 
cause of its recognition and study, 
and partly because of a change in the 
environment in which it operates. 
Where originally it seemed firmly 
grounded in group cohesiveness and 
social motives, it has become appar- 
ently a much more conscious tool of 
bargaining. While at all times there 
seemed to be an element of fear of 
rate change, and a protest against 
what were perceived as too high rates 
of work, more recently the restriction 
of output seems to be used as a de- 
liberate tool preliminary to bargain- 
ing for something else. For example, 
in the building trades, the railway 
workers, and longshoremen restric- 
tion of output has led to fixed work 
schedules—either shorter work-weeks 
or schedules based on units of produc- 
tion, with overtime pay for what 
would be normal productivity before 
restriction. Although the overt be- 
havioral signs may remain the same, 
the psychological context underlying 
them may have changed radically to 
a more conscious use of the group’s 
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power to gain a leverage in a bargain- 
ing situation. 

The small group theorists have also 
markedly changed the interpretation 
of leadership. Particularly in the in- 
dustrial setting, the traditional view 
of the leader in the past was of a 
charismatic individual possessing the 
trait of leadership. A great deal of re- 
search went, without much success, 
into attempts to identify these qual- 
ities of leadership in the interest of 
selection. The emphasis on the group 
and group processes led to a descrip- 
tion of the leader in the mold of The 
Admirable Crichton—a man who, in 
the particular situation, possessed the 
skills and abilities to provide means 
for the satisfactions of the needs of 
group members. We now speak of 
the ‘‘emergent’’ leader, we distin- 
guish between ‘‘headship”’ and ‘‘lead- 
ership,’ and we use “buddy ratings”’ 
to identify leaders. The pendulum 
has swung a long way to one side, 
and the reverse trend is already dis- 
cernible in the assessment field, in a 
return to the search for the qualities 
of leadership within the individual 
instead of the group. However, even 
as the pendulum swings back, it is 
more group-oriented, and the vari- 
ables tend to deal with relations with 
others rather than decisiveness, force- 
fulness, and determination. For in- 
dustrial studies of leadership, the 
problem will surely have to be re- 
ferred to the growing body of studies 
of organization theory. For example, 
as the size of the group increases, the 
simple ability to hold the group to- 
gether becomes one of the most im- 
portant problems, and, in many 
cases, a corporate entity seems to di- 
rect itself a large proportion of the 
time, with the leader’s job being to 
keep it working as a unit, to keep in- 
formation flowing through it, and to 
adjust the parts to one another. 
Again, as the study of group process 
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progresses, particularly as it moves 
toward organization theory, the re- 
statement of the problem of leader- 
ship seems a likely fruitful outcome. 

In the tradition of the sociometric 
analysis of the internal structure of 
groups, there is a lively development. 
Danzig and Galanter (1955) and 
Weiss and Jacobsen (1955) have 
proposed sophisticated sociometric 
analyses of industrial groups and dem- 
onstrated their applicability empir- 
ically. The techniques are computa- 
tionally cumbersome, but they are 
encouraging in the indication that it 
is possible to work with groups of at 
least two or three hundred, and that 
fruitful results may come from such 
analyses. The first of these studies 
provides operational meaningfulness 
to the concepts of cohesiveness, social 
distance, and centrality, using the 
terms somewhat as Bavelas (1948) 
did in dealing with communications 
nets. Developed further, this would 
mean that it was possible to define all 
the-radii of an industrial group at a 
given point in time, and, hence, to 
describe the shape of the organiza- 
tion in a much more functional man- 
ner than in the traditional “family 
tree’ organization chart. If it were 
possible thus to represent the shape 
of the organization it would allow us 
to do a kind of longitudinal study 
which has been so far unavailable to 
social psychologists. The anthropo- 
metric studies in the area of child de- 
velopment have been fruitful sources 
of understanding. A similar historical 
picture of the growth of social or- 
ganizations would seem equally 
promising if the means of represent- 
ing shape and structure at various 
stages becomes available. The Weiss 
and Jacobsen study points to an- 
other similar possibility. They iden- 
tified subgroups on the basis of the 
kinds of contacts which occurred— 
separated work groups, liaison peo- 











PROBLEMS RELEVANT TO BUSINESS AND INDUSTRY 


ple, and isolates. Of their group of 
200, about 18% were tentatively 
identified as having a liaison func- 
tion. Again, this kind of analysis 
could provide a measure for the 
growth of organizations with which 
we could see the rate of demand for 
liaison as the size increases. Such 
studies could give us insight into the 
stresses within the system arising as 
a result of growth. 

Also in the sociometric tradition, 
although a long way from its origin, 
would be studies like Cartwright and 
Harary’s (1956) use of graph theory 
to deal with symmetry and asym- 
metry in attitudes within the struc- 
ture. French (1956) uses a similar 
approach to deal with social power. 
Such sociometric analyses have been 
urged as a management tool, or, as 
the sociometrist develops toward the 
study of interaction process, as a pre- 
dictive device (Chapple, 1953). The 
study of group structure through 
interaction process analysis has de- 
veloped both in technique and theo- 
retical statement (Bales, Flood, & 
Householder, 1952). It is still cum- 
bersome, but in some ways it is the 
natural bridge between the static 
structuralism of the pure sociometric 
tradition and the dynamics of the 
Lewinians. A special variety of the 
study of interaction structures within 
groups is the field of communication 
nets and their relative effectiveness. 
Most of the work stems originally 
from Bavelas’s model (1948) and 
there has been a certain amount of 
sameness in both the form and find- 
ings of the empirical research 
(Bavelas, 1950; Heise & Miller, 1951; 
Leavitt, 1951; Leavitt & Mueller, 
1951; Shaw, 1955; Shaw & Roths- 
child, 1956). However, in view of the 
central role of communication and 
cummunications nets in the viability 
of industrial organizations, this must 
be seen as a point from which re- 
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search progress will flow. 

One other area of small group prob- 
lems seems particularly relevant to 
economic behavior, although it is not 
possible to point to any considerable 
body of published work at present. 
Economic theory has always included 
hypotheses about both human moti- 
vation and interactions between peo- 
ple. Recent developments, following 
historically, though not necessarily 
logically, from the theory of games, 
contain two possibilities especially 
relevant for small group theory and 
research: utility curves for shared 
risk, and variable information in stra- 
tegic decision matrices. It is not pos- 
sible to do more than point to the ex- 
istence of the problems here. It is 
possible to construct psychological 
curves (rather than strictly logical) 
for the risk an individual will run to 
gain a given objective. Such a func- 
tion contains a multitude of factors, 
expectations, risk-taking proclivities, 
present position, and the like. The 
empirical curve is typically not mon- 
otonic. Such risks can be experi- 
mentally shared, in the fashion of 
pooled insurance risks, but it is not 
necessarily true that the new value 
will fall at the point indicated by di- 
viding the risk and return by the 
number of sharers. The field suggests 
possibilities for a joint study of group 
structure and process and the eco- 
nomic area of risk-taking. On the 
other hand, strategic decision theory 
tends to rely on complete information 
or complete lack of it among partici- 
pants in competitive situations; a 
situation which facilitates model 
building but does not approximate 
reality very closely. Again, it seems 
possible to bring to bear the tech- 
niques of the communication-net re- 
search and the work on cohesiveness 
in groups to add a realistic dimension 
to empirical studies of strategic deci- 
sions. 
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Motivation 

In many ways the history of the at- 
tempt to deal with motivation in in- 
dustrial social psychology covers a 
large part of the whole field. Selec- 
tion devices in the strict tradition of 
individual differences were soon seen 
to be inadequate on the simple level 
of skills and ability. Motivations 
were introduced here as individual 
variables in interest inventories like 
the Strong and Kuder. As the 
individual came to be studied in the 
context of the industrial organiza- 
tion, stress was put on higher or- 
der needs—for example, loss of so- 
cial need satisfaction on the one 
hand through mishandling of group 
structure, and of egoistic need satis- 
faction on the other through de-skill- 
ing in the rationalization of jobs. 
Surveys of attitudes moved the moti- 
vational interest from the level of the 
individual to the group average. 
Questions were still asked individ- 
ually, but the results, in terms of 
both the responses and the criterion 
(usually production), were from the 
total group. With somewhat dis- 
couraging results in a simple relation- 
ship between attitudes and criterial 
performance, interest shifted to 
studies of attitude change, and the 
“human relations’ movement is 
largely a program of development of 
attitudes among leaders to manipu- 
late the ‘‘climate”’ seen as a variable 
in early group studies. Finally, it is 
largely the interest in motivation 
that shapes psychological studies of 
organizatie”, perhaps stemming from 
the early Lewinian suggestion that it 
is easier to reorganize the structure 
of the whole group than to change 
the individual and leave the group as 
it was. Within this sketchy overview 
lies a large proportion of the re- 
searches in industrial social psychol- 
ogy, a tremendous body of relatively 
unfruitful work on attitudes, and 
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some promises for future progress. 

In general, the studies of attitudes 
and productivity stem from the law 
of effect notion in learning theory: or- 
ganisms tend to seek out situations 
that are rewarding and avoid those 
that are punishing. Katz and Kahn 
(1952) make this explicit for human 
relations, Haire (1957b) for indus- 
trial leadership, and Brayfield and 
Crockett (1955) for attitude studies. 
In hoping to predict high perform- 
ance from positive attitudes, the rele- 
vance of the principle is not perfectly 
clear, and, indeed, a summary of a 
host of empirical investigations would 
suggest that a positive relation be- 
tween the two is tenuous at best. On 
the positive side, the law of effect 
would involve the worker finding 
satisfactions on the job, and these 
satisfactions would be revealed by 
appropriate attitude surveys. It is 
not necessarily true, however, that 
the pursuit of these satisfactions 
would lead to productivity. On the 
negative side, lack of satisfactions 
might be expected to lead to avoid- 
ance of the job-situation, and, since 
presence is a necessary condition for 
productivity, to reduced criterial per- 
formance. Indeed, some studies do 
suggest a relation between negative 
attitudes and withdrawal in the form 
of turnover, accidents, and absen- 
teeism (Brayfield & Crockett, 1955). 
On the positive side, however, we 
seem to need considerably more dif- 
ferentiation of motivational theory 
to serve as an intervening variable 
between surveyed attitudes and ob- 
served productivity, and better co- 
ordinating definition between the 
motivational constructs and the re- 
alities of the industrial situation. 

A step in this direction is found in 
motivational analyses. The Michi- 
gan studies of productivity and mo- 
rale (Katz, Maccoby, Gurin, & Floor, 
1951) for example, introduce inter- 




















PROBLEMS RELEVANT TO 


vening steps of group-oriented versus 
task-oriented supervision between 
attitude and the criterion. Without 
this kind of differentiation, the meth- 
odological elegance of scaling and 
the development of subscales in atti- 
tude surveys seems likely to be of 
little use in the future on this prob- 
lem, as it has been in the past. The 
studies of Walker and Guest, referred 
to above, broke attitudes down into 
social satisfactions associated with 
contacts on the job, egoistic satisfac- 
tions associated with feeling of 
achievement at work, and the like. 
While these studies did not relate the 
findings to criterial performance, 
they provide a step toward it. Sim- 
ilarly, a group of other studies (Katz 
& Kahn, 1954; Schaffer, 1953; Wick- 
ert, 1951) suggest the possibility of 
identifying and producing egoistic 
satisfactions at work, and, in some 
cases, of relating them to criterial 
performance. 

The assumption that high levels of 
satisfaction of the individual’s needs 
will be related to productivity does 
not seem to hold. Put in these terms 
it is not surprising; there is no reason 
to believe that the individual’s goals 
and the organization’s will coincide. 
A more detailed understanding of the 
kinds of satisfactions determining be- 
havior at work (productive and 
other) and, particularly, the ecology 
of drives at work suggested above 
would seem to be needed before sim- 
ple relationships between morale and 
productivity can be hoped for. 

One area of motivational studies is 
surprisingly lacking—the study of 
motivation of management. While 
psychologists have inveighed vigor- 
ously against the oversimplifications 
of the Economic Man as a motiva- 
tional model for workers, I suspect 
that we have largely kept it in inter- 
preting management behavior. The 
manager is seen generally as actuated 
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by money and power as motives, al- 
most as if there were a difference in 
kind between superiors and sub- 
ordinates. A broad program of in- 
vestigation in this area would seem 
worthwhile; it would also tie in well 
with the assessment approaches to 
the identification of high level talent. 
As it develops, such an interest would 
go beyond the simple motivational 
interpretation of behavior, and, per- 
haps, lead us to a statement of what 
might be called a philosophy of man- 
agement. Now, when the corpora- 
tion is taking on a very different role 
in the national social structure, and 
when the professional manager is a 
dominant figure in the corporation, 
the general value systems in which.he 
operates become of especial impor- 
tance. The other side of this coin, in 
a sense, is the social role of the man- 
ager. The American manager has a 
quite different role in the view of the 
public from the position of the man- 
ager in most of the other Western 
countries. Indeed, much of the 
strength of American industry seems 
to flow from this. It would seem an 
appropriate area for study to investi- 
gate this closely. Such work would 
be closely related to the companion 
studies of role perceptions of man- 
agers of themselves and of their posi- 
tions, and of the influence of hier- 
archical levels on self-perception. 

It seems somewhat strange that 
motivational analysis has not in- 
cluded the various levels of manage- 
ment. Studies of fatigue have con- 
centrated on the worker, whose hours 
are typically much less than the man- 
ager. The emphasis on social and 
egoistic need satisfactions at work 
have been primarily on the hourly 
paid worker, as if he alone had the 
sensibilities to avoid the rigid deter- 
minism of economic motivation. 
Some of the role problems of the fore- 
man in complex structures have been 
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pointed to (Ghiselli & Lodahl, 1958; 
Porter, 1959), though there is little 
research, but there is virtually no 
work in psychology on prestige and 
status as springs of action in the man- 
agement structure. Our picture of 
the problems of human motivation 
and organizational goals will never be 
complete without an analysis of the 
directing part of the structure as well 
as the larger portion of the work 
force. 


Communication 


The general problem of communi- 
cations has long been a part of indus- 
trial psychology. Early it was largely 
stimulus bound. The human engi- 
neering work on the size of type is in 
this tradition, and, to a large extent, 
the Flesch count is a modernization 
of this original question, substituting 
length of sentences for size of type. 
Early work in advertising, too, gen- 
erally followed this tradition, con- 
centrating on stimulus characteristics 
designed to attract attention.2 More 
recently, work on communication 
has turned to structural factors in 
grqups, mentioned above, and to a 
more detailed study of the process of 
communication. 

As the problem moves from the 
simple one of stimulus presentation 
and a criterion of recall or failure to 
recall, the details of the process be- 
come important. In industry itself, 
some studies have suggested that cer- 
tain kinds of information can be dif- 
ferentially utilized by workers (Chis- 
holm, 1955). Hovland and co-work- 
ers (Hovland, 1957; Hovland, Janis, 
& Kelly, 1953) have considerably 


2 This paper does not go into the consider- 
able body of work which has recently been 
done in the field of market research and con- 
sumer behavior. It could go appropriately 
here, but it is covered in another paper in this 
series, by Paul Lazarsfeld, in The American 
Journal of Sociology. 





MASON HAIRE 


differentiated and enlarged the stim- 
ulus problem. They raise the problem 
of primacy, tieing the problem of 
mediating attitude change to more 
traditional learning studies. Hovland 
also extends earlier work on the pres- 
tige of the communicator by varying 
source credibility, and opens the sub- 
ject’s motivational system by study- 
ing the effects of fear-arousing prop- 
aganda. The Yale studies also en- 
larged the criterion problem, study- 
ing the process as well as the fact of 
retention by observing changes in 
material over a period of time. Fur- 
ther internalization of the process is 
indicated by the suggestion that 
identifiable steps of identification, in- 
ternalization, and compliance can 
fruitfully be recognized in the sub- 
ject. Cartwright (1949) made some 
detailed suggestions about the condi- 
tions necessary for communication to 
produce a change in attitude. An- 
other facet of the process has been 
opened by Festinger’s (1950) inquiry 
into the motives for communication. 
Most of these developments have 
been in relatively pure laboratory 
situations rather than in industry. 
However, the variables seem imme- 
diately relevant to present industrial 
problems, and the progress in stating 
the problem promises future growth 
in the attack on communication in 
industry. 

The rise in the field of social per- 
ception probably deserves to be in- 
cluded under the general heading of 
communication, since it deals with 
variables influencing the reception 
of stimulus information. The term 
“social perception” seems to have 
two distinct meanings (part from the 
third usage which Brunswik gave it): 
on the one hand, the influence of the 
social group on the process of percep- 
tion, stemming from the Sherif ex- 
periments on autokinetic movement, 
of which the conformity studies al- 
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ready mentioned are ideological de- 
scendants. The other meaning is the 
perception of social phenomena—of 
personalities, roles, institutions, and 
the like. Here there is a great deal of 
work relevant to industrial problems 
—studies of roles and role percep- 
tions in hierarchies, of labor and 
management’s perceptions of one an- 
other, and the definition of leadership 
in terms of the perception of supe- 
riors and subordinates. 

In dealing with the problem of in- 
formation presentation under the 
general heading of human engineer- 
ing, it was necessary to broaden the 
problem somewhat to the problem of 
decision theory, the requirements of 
information in teams, and informa- 
tion under conditions of uncertainty. 
Somewhat the same set of problems 
appear properly under the present 
heading. Edwards (1954) summar- 
ized the work on decision making and 
its relation to economic behavior, and 
the field has made some progress 
since that time. It still seems an un- 
usually fruitful area for research. 
Atkinson (1957), for example, has 
opened the question of motivational 
determinants of risk-taking behavior, 
and Scodal and others (Scodal, 
Ratoosh, & Minas, in press), return- 
ing to the tradition of the personnel 
psychologist, have investigated the 
personality correlates of risk-taking. 
The questions in this area are di- 
rectly related to the typical business 
situation and make it possible to 
bring psychological principles to bear 
most immediately. It has been 
pointed out before that the psycholo- 
gist has traditionally inveighed 
against an oversimplified concept of 
economic man as a model for motiva- 
tion. The risk-taking problem lets us 
test more sophisticated motivational 
notions directly in the very situation 
for which the economic man model 
was invented. Expectations, levels of 
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aspiration, past experience, and self 
concepts all are relevant to the risk- 
taker’s behavior. The amount and 
kind of information is a variable in 
strategic decision matrices. The posi- 
tion of an individual within a group 
bears on immediate relevance to the 
problem. In a sense the restriction of 
output already referred to is part of 
the problem—either in its original 
sense of turning away from economic 
advantage to maximize social need 
satisfactions, illustrating some transi- 
tivity in motivational values in util- 
ity curves, or in its recent sense of a 
more complex bargaining for con- 
tractual advantages in a situation 
more nearly approaching the classic 
theory of games. The combination of 
these problems with, for example, 
those of the experimental sharing of 
risks, and the human engineer’s in- 
terest in information display, would 
seem to indicate a broad and fruitful 
field for the future. 


CONCLUSION 


It is perhaps easiest to set a sum- 
mary statement in terms of a few 
brief points: 

1. The history of the field points to 
the term “psychology in industry,” 
rather than “industrial psychology.” 
Within it three quite separate tradi- 
tions thrive: the experimentalist, 
manipulating an independent varia- 
ble which is usually a physical stim- 
ulus, the differential psychologist, 
and the social psychologist. Each of 
the three carries the threads of gen- 
eral psychological theory, and the 
development of the field is primarily 
the development of psychology, mod- 
ified somewhat by external societal 
pressures, and somewhat less by spe- 
cific demands of industrial problems. 

2. In general, the whole field has 
focused on work, workers, and the 
conditions of work, and, to some ex- 
tent, in spite of the interest in leader- 
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ship, neglected the problems of the 
motivation of management, condi- 
tions of work for leaders, and the like 
have been largely neglected. Recent 
interest in the general problem of or- 
ganization seems to draw attention 
to the over-all problem of the opera- 
tion rather than to the level of pro- 
duction. 

3. Traditional personnel psychol- 
ogy seems to have stalled on the prob- 
lem of selection and classification, 
though iterative solutions to the 
classification problem provide a new 
elegance. The problem of the crite- 
rion continues to be the chief stum- 
bling block, as it has been for 40 
years. Assessment of high level tal- 
ent and research on creativity is 
opening a new route in this area, with 
the possibility of taking advantage of 
the arithmetic of correlation to pro- 
vide considerably greater leverage in 
the selection of men for managerial 
positions. Like the criterion problem, 
the assessment of training continues 
to be much discussed but is dealt 
with only to a limited extent. 

4. Among the areas which look 
promising for the future, that com- 
bining the problems of risk-taking 
and decision theory seems to com- 
bine most of the psychological fields 
and to promise relatively immediate 
yields. Outside of psychology there 
is a good deal of theoretical develop- 
ment to set the problem. Within 
psychology, most of the fields con- 
cerned bear on it. The differential 
psychologist concerns himself with 
personality correlates of risk-taking, 
the engineer with information in de- 
cision making, and the social theorist 
with motivational problems in both 
areas and with the structure of 
groups, for example, in shared risks. 
Relatively few publications have ap- 
peared so far in this area. 

5. A second area which combines 
the various fields is the growing in- 
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terest in organization theory. The 
communications theorist approaches 
it through the problem of networks, 
the human engineer approaches it 
through a similar problem in provid- 
ing information in complex systems, 
and the social psychologist enters the 
problem through the various routes 
of motivation, group structure, and 
roles and status. There is consider- 
able activity in the field, but a sur- 
prising dearth of empirical research. 
Though we have quite a few the- 
oretical statements about how or- 
ganizations grow, we have, surpris- 
ingly, virtually no simple histories of 
how they have in fact grown. Both 
empirical and theoretical work in this 
area seems just around the corner. 
6. Running through a good many 
problems, one development seems 
associated with progress, as it is in 
psychology in general. Where a new 
advance has been made, it has usu- 
ally been with a differentiation of the 
criterion of the original problem, 
which made it possible to bring to 
bear other areas of psychological 
theory. For example, in communica- 
tion, the advance from an early stim- 
ulus bound tradition is marked by 
the work of Hovland and his co- 
workers in opening problems of recall 
and long-term effect on the one hand, 
and conditions of the communicated 
material on the other. Festinger 
opened the question of motivation 
for communication, and the psycho- 
logical process became the topic 
rather than the simple fact of com- 
munication. Again, in the area of 
selection, where the criterion seems 
to block further research, a simple 
lengthening in time of the criterion 
measure might lead the way to 
further insight. Present criteria are 


usually tested shortly after employ- 
ment, and obscure the nature of the 
development of criterion perform- 
Longer measures might focus 


ance. 
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attention to the slope of the curve of 
response to training, to the interac- 
tion of treatment and aptitudes, or 
to the change in factorial content of 
skills with practice. In the general 
area of work and productivity, the 
Coch and French study of resistance 
to change shows the benefit from dif- 
ferentiation of the criterion and the 
provision of theoretical meaningful- 
ness to the new parts. Similarly, in 
the studies of motivation and produc- 
tion, the opening of the area of the 
ecology of motivation by studying 
the work-place both advances the 
understanding and broadens the the- 
oretical base on which it rests. If 
there is one critical area to advance 
in these problems, the progressive dif- 
ferentiation of the would 
seem to be it. 

7. Finally, a group of other prob- 
lems seems to have the support in 
psychological theory for development 
in this field. The problems of roles 
and status in well-organized hier- 
archies, the general area of a shifting 
management philosophy in response 
to changes in the corporation and the 
society, and the problem of conform- 
ity are illustrative. In some cases the 
psychological problem needs to be 
broadened to include the societal one, 
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the contrary basis is too persuasive 
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very success in applying psychologi- 
cal principles to the problems in busi- 
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Few major topics in contemporary 
psychology appear to offer more 
promise than rigidity, and the 
amount of work reported on this sub- 


ject has been increasing year by year. 
Unfortunately it is also the case that 
few areas present such a quagmire to 
the unwary investigator; rigidity is 
not a simple concept, and the subdi- 
visions within it are far from clear. 
The aim of the present paper is two- 


fold: to summarize and assess the re- 
sults of recent explorations, and to 
suggest one or two paths which might 
be followed further. 

“Rigidity” has proved a difficult 
term to define acceptably for it has 
been used to describe behaviors char- 
acterized by the inability to change 
habits, sets, attitudes, and discrim- 
inations. It has grown out of related 
topics, such as perseveration and the 
analysis of personality traits. The 
term perseveration was first used by 
Neisser in 1894, and described by 
Spearman in 1927 in 
mental law of inertia: ‘Cognitive 
processes always both begin and 
cease more gradually than their ap- 
parent The experimental 
work on perseveration was largely 
concentrated on the motor and 
sensory processes, though emotional 
and ideational aspects were also con- 
sidered by some people. The next 
step forward came in 1935 when Cat- 
tell (1935a) made the distinction be- 
tween “the inertia of mental proc- 
esses,’ found when a person is asked 
to alternate between two previously 
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causes. 
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practiced motor skills, and ‘‘disposi- 
tion rigidity,”’ which operates when a 
familiar task has to be performed in 
some new way. The latter obviously 
bears a relationship to retroactive 
inhibition and to habit interference, 
and it is only this aspect of persevera- 
tion which Cattell has studied fur- 
ther. He found that disposition rigid- 
ity was related to certain personality 
factors—those of submissiveness and 
lack of character integration. 

Similarly, the history of work on 
the rigid personality can be traced 
back to William James, who divided 
men into tough and tender minded, 
and to Thurstone who found a factor 
of radicalism-conservatism. Eysenck 
(1944) combined these two in the 
same matrix and found that they 
were orthogonal to each 
Frenkel-Brunswik (1949a) sug- 
gested that the prejudiced person 
(i.e. tough and conservative) also had 
a rigid personality. Obviously, this 
“rigidity”? need not be the same as 
that studied by Cattell, but the use 
of the same word has highlighted the 
search for a link between flexibility of 
personality and flexibility of change 
in ideas and habits. 

In the light of the history of the 
term, one of the best definitions 
seems to be that given by Cattell 
(Cattell & Tiner, 1949) when he de- 
scribed disposition rigidity as the dif- 
ficulty with which old established 
habits may be changed in the pres- 
ence of new demands. Examples of 
other definitions are those by Ro- 
keach (1948): ‘‘the inability to change 
one’s set when the objective condi- 
tions demand it,’’ Buss (1952): “‘re- 
sistance to shifting from old to new 
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discriminations,’’ Goldstein (1943): 
“tadherence to a present performance 
in an inadequate way,” and Werner 
(1946a): “lack of variability of re- 
sponse.”” These definitions agree in 
general outline but vary in detail to 
an extent sufficient to cause confu- 
sion. There is no agreement over the 
meaning of ‘ta demand to change,” 
“an inadequate performance,” “‘re- 
sistance to shifting a set,’’ and “‘ina- 
bility to change.”’ For example, the 
change of method demanded in some 
tests has been one which is not essen- 
tial to success in the task; it may 
even cause the time spent on the task 
to increase. 


Explanations of Rigidity 


Three explanatory approaches to 
the study of rigid behavior will now 
be described; these are Cattell’s 
work, Goldstein's study of the brain- 
injured, and the Lewinian theory. 

Cattell (Cattell & Tiner, 1949) dis- 
criminated between two types of be- 
havior which had both previously 
been called ‘“perseveration.”” The 
first is “‘process rigidity” or the tend- 
ency for a former response to continue 
although a new stimulus has been 
substituted for the old one. Another 
name for process rigidity is ‘‘mental 
inertia,’’ and it is best seen in alterna- 
tion tasks. (Temporal contiguity is 
essential if it is to occur at all.) He 
calls the second type “structural 
rigidity.”” This is the resistance of a 
habit or personality trait to forces 
which might be expected to change 
it. The habit remains unchanged de- 
spite the fact that a more ‘“reward- 
ing’’ response to the new stimulus 
could be made. Temporal contiguity 
of the two activities need not occur. 
It is difficult to distinguish com- 
pletely between process rigidity and 
structural rigidity, though each can 
be clearly described by examples. 
Walker, Staines, and Kenna (1943) 


SHEILA M. CHOWN 


showed that process rigidity could ac- 
tually be explained in terms of struc- 
tural rigidity. 

Cattell suggested that there might 
be three main causes of structural 
rigidity. In the first place, low g 
and also low “fluency of random as- 
sociation”’ might prevent a person 
from seeing that a new response is 
necessary or prevent his realizing 
what the new response should be. 
Defective strength of motivation or 
conflicting motives might also cause 
structural rigidity. Cattell maintains 
that there is a third source of rigidity, 
which lies in “‘a resistance to change 
of neural discharge paths,’’ which 
will be ‘‘a basic attribute of all dis- 
positions,’’ and which cannot be al- 
tered by applying rewards or punish- 
ment. He equates this with “disposi- 
tion rigidity’’ which he considers to 
be the most interesting source of 


rigid behavior. However, in his 
opinion, all research on_ rigidity 


should control the three main causes 
of structural rigidity and relate any 
new discoveries to them. The neces- 
sity for studying all causes of a par- 
ticular piece of rigid behavior is very 
obvious in any field investigation. 

It seems likely that both process 
rigidity and structural rigidity will 
enter into learning difficulties. Men- 
tal inertia will probably be of small 
importance compared with the ef- 
fects of having to modify a habit of 
long standing. This paper will not 
deal with process rigidity, since a dis- 
cussion of this may be found in 
Eysenck (1953). 

Many investigators have been in- 
terested in the rigid behavior which is 
a symptom of certain abnormal con- 
ditions. Goldstein (1943) deals with 
the two types of rigidity resulting 
from organic brain damage. ‘‘Pri- 
mary rigidity” is the inability of a 
patient to change from one train of 
thought to another. The patient pays 











no attention at all to stimuli which 
are unrelated to the matter in hand. 
“Secondary rigidity’”” is displayed 
when a person is faced with a problem 
which is too difficult for him; he pre- 
fers an incorrect answer to making no 
response at all. It occurs particularly 
when patients are asked to deal with 
abstract things, which they can no 
longer understand. The main cause 
of rigidity in these patients would 
appear to be lack of g. However, 
Goldstein points out that lack of g 
due to brain injury may also cause 
“distractability’”” when the patient 
will light in turn on a number of 
small points rather than face the 
main problem. 

Werner (1946b) has shown that de- 
fectives display a different type of 
rigidity from those with brain inju- 
ries. The former fail to solve problems 
because they oversimplify them. The 
latter seize on resemblances to prob- 
lems which they have previously 
faced, and try to use methods which 
are no longer applicable. For Werner, 
rigidity is a functional rather than a 
structural concept, and multiform 
rather than unitary. He holds that 
in many cases differences between 
investigators stem from their differ- 
ent interpretations of the 
“rigidity.” 

The Lewinian theory links the ex- 
istence of rigidity to the presence of 
strong boundaries between mental 
functions. As various boundaries 
may differ in strength in one indi- 
vidual no relationship can be ex- 
pected between one test of rigidity 
and another which involves a differ- 
ent function. The results explained 
in this way can, however, be ex- 
plained in other ways as Goldstein 
(1943) has already pointed out. Most 
experimenters have avoided the 
Lewinian use of the term, preferring 
to think of rigidity as a description of 
behavior for which further explana- 
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tion is necessary, rather than as an 

explanation in itself. ‘‘Permeability” 

would be a better name for the prop- 

erty of the “mental boundaries” in- 

volved in the Lewinian explanation. 
TEsTs 

The experimental work on rigidity 
has been carried out with the aid of 
a number of tests, each of which can 
be said to measure ‘“‘rigidity’’ in its 
own right. But the relationships be- 
tween these tests are not always 
known, and where they have been in- 
vestigated, it seems as though more 
than one type of “rigidity” is in- 
volved. 

A brief description of the main 
tests will be given here; their rela- 
tionships with each other will be de- 
scribed more fully in the next section. 


Einstellung Tests 


Einstellung tests involve building 
up a “‘set’”’ in the S and then giving 
him a problem which is best solved 
in some way other than the one he is 
expecting. 

The best known Einstellung test 
was devised by Luchins (1951la, 
1951b). The original version con- 
sisted of 10 problems in which the Ss 
had to discover a method of obtain- 
ing a certain quantity of water if 
given three containers of specified 
size. The first five problems could be 
solved in one way only and were 
meant to establish a set. The next 
two could be solved either in the set 
way, or by a shorter method. The 
eighth could be solved only by the 
second method and the last two by 
either method. The S's methods of 
solving the sixth and seventh prob- 
lems are said to give a measure of his 
susceptibility to set, and the last 
three problems show how well he is 
able to overcome the set. Modifica- 
tions of the test have included the 
introduction of an initial problem 
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which can be solved by the “long”’ 
or “short”? method, alteration in the 
number of “set’’ problems given, 
omission of the problem which can 
only be solved by the short method, 
and using all types of ‘“‘error’’ to form 
one rigidity score. 

Cowen, Wiener, and Hess (1953) 
developed a test of Einstellung rigid- 
ity using the same principles as the 
water jar test but involving a differ- 
ent type of skill. Each of the prob- 
lems of their alphabet maze test is a 
six-by-six letter square. The S has to 
move one box at a time from the up- 
per right hand corner of each square 
to the lower left hand corner spelling 
out words on the way. He may move 
in any direction as long as the move 
helps to spell out a word. In case 
more than one path is available, the 
correct solution is the one that uses 
the fewest number of boxes. 

Another form of Einstellung test 
was that of Rees and Israel (1935) 
who used anagrams. The scrambling 
order is the same for all the problems 
of the first series. Then anagrams are 
given which can be solved by using 
either the same order or another one. 
An alternative form of anagran:3 test 
first uses words which are all associ- 
ated with one topic, e.g. food or na- 
ture, and then gives anagrams having 
an alternative solution which is not 
connected with this topic. Experi- 
menters have usually had an intro- 
ductory series of 20 to 25 anagrams, 
and a similar number of ‘“critical’”’ 
problems. 

The water jar test of rigidity has 
received its most severe criticism 
from Luchins himself (1951b). Levitt 
(1956) in his recent review of the 
literature concerning  Einstellung 
tests also covers many of the seven 
points described below. Firstly, users 
of the test assume that Einstellung 
rigidity can be equated with rigidity 
as displayed in behavior in the world. 
The criticism is true of all tests of 
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rigidity, since none of them have 
been measured against actual behav- 
ior. Secondly, a certain amount of 
ability at arithmetic is needed to do 
the tests at all; most experimenters 
have had to exclude a number of Ss 
who could not subtract correctly. 
Another point is that some people use 
the long method of solution even be- 
fore they have been “trained” by the 
set problems. In many experiments 
these people too have been excluded. 
The fourth—and greatest—fault is 
that problems with alternative solu- 
tions do not require the Ss to change 
the method they have been using all 


along. For these “trained’’ Ss, the 
“set”? method may well be quicker 
than the new shorter method. A 


fifth serious criticism is that the Ss 
may have other reasons than rigidity 
of thought for their choice of the 
‘set’? method when a simpler alterna- 
tive is possible. These reasons in- 
clude thinking that they are meant to 
use the same method each time, and 
thinking that all three jars must be 
used to do each problem. Sixthly, 
the emotional impact of the words 
“test” and ‘‘quickly” in the instruc- 
tions may cause the Ss to use meth- 
ods which they would not otherwise 
employ. Lastly, in scoring the test, 
it seems apparent that susceptibility 
to set and ability to overcome it are 
different measures and should be 
kept separate. 

The alphabet maze test is open to 
many of the same objections as the 
water jar test. Moreover in those 
problems which include two paths 
the transition probabilities between 
letters should have been made equal 
at the point of choice. For example 
the correct solution in one problem is 
“ther hat” and the incorrect one “her 
red suit.”’ After h,e,r, another r is 


more probable than an h, and the S’s 
choice may be biased even if he is not 
aware of it. 

Tests employing anagrams have 
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the disadvantage that Ss might find 
the “‘nonrigid’”’ solution, and go on 
looking for the rigid one. Unlike the 
arithmetic problems, there is no 
“‘better”’ or “‘best”’ solution to an ana- 
gram that can be solved in two or 
more ways. 

Though attempts to develop other 
Einstellung tests have been made, 
these three seem the most hopeful. 
It seems that a better measure of our 
chosen definition of rigidity would be 
obtained by cutting out ‘“‘suscepti- 
bility to set’’ and dealing only with 
“the ability to overcome set.’’ The 
precautions suggested previously 
should be taken when any of these 
tests are to be used and there are 
other improvements which could be 
made. For instance, a more satisfac- 
tory scoring system than ‘“‘number of 
non-Einstellung problems correctly 
solved’ might be achieved. One pos- 
sible measure would be the time 
taken to solve each of these problems, 
compared with the time taken to 
solve the Einstellung problems. Such 
a measure would enable a number of 
Einstellung series to be given to one 
individual and ranged in order of the 
difficulty of the change required to 
solve the non-set problems. In this 
way, it would be possible to develop 
a rigidity scale. 


Concept Formation Tests 


Several concept formation tasks 
have been used to study intellectual 
rigidity. Examples are the Wisconsin 
card test, the Weigl card sorting test, 
the New York University card sort- 
ing test, and Buss’s wooden blocks 
test which he took over from the 
Vigotsky test. The Wisconsin card 
test will be described more fully. It 
consists of four stimulus cards and 64 
response cards. The stimulus cards 
contain one red triangle, two green 
stars, three yellow crosses or four blue 
circles. Each response card has on it 
one of four shapes in one of four col- 
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ors, and the figures appear from one 
to four times on the card. Thus it is 
possible to sort the cards according to 
shape, color or number under appro- 
priate stimulus cards. 

In all the tests, the general pro- 
cedure has been for the investigator 
to decide arbitrarily which variable is 
to be the basis for grouping and then 
to ask the Ss to discover it. Once the 
S has responded correctly a certain 
number of times, the critical variable 
is changed without informing him. In 
this way, he is forced to form a new 
concept or else to continue to fail by 
sticking to the one which was previ- 
ously successful. It has been found 
that some Ss never realize what is 
happening, while others have such 
clear insight that they can later de- 
scribe the experimenter’s plan of 
campaign. One difficulty in evaluat- 
ing the test is that a person who once 
realizes that the experimenter has 
changed the criterion has a good 
chance of recognizing further changes. 
In fact, he will become ‘‘set’’ to look 
out for changes. 

Concept formation plays an im- 
portant part in everyday life, and 
tests which utilize this function seem 
worth pursuing. Again, concepts of 
different levels of difficulty need to 
be studied and work is needed which 
will compare a person’s performance 
when he knows changes will occur 
with his performance when he is left 
to discover the changes for himself. 
Personality Tests 

Only one paper and pencil test of 
personality rigidity has been pro- 
duced so far. Wesley (1953) chose 
the items for this empirically, and 
then had them rated for degree of 
rigidity by five psychologists. The 
final test consisted of 50 items which 
all the judges rated as “high.’”’ Zelen 


and Levitt (1954) maintain that a 
number of the items are duplicated 
and that the same reliability and 
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validity was obtainable from fewer 
items. They chose 12 which looked 
to them to be representative of the 
whole test and claimed that the 
shorter version correlated highly with 
the original version (r=.73). They 
also claimed that the reliability was 
at least as high as that of the original 
version (r= j74 for both). The short- 
ened form correlated to the same ex- 
tent (.38) as the longer one (.26) with 
the California Ethnocentrism Scale 
—the only comparison of validity 
made. The shortened form contains 
one item which could hardly be ap- 
plicable in all countries, i.e. “‘I never 
miss going to church,” but the other 
items appear to be “‘culture free.” 

This test in both its forms is of 
course open to all the criticisms gen- 
erally levelled at paper and pencil 
inventories. Also no item analysis 
or analyses of the correlations be- 
tween items is reported, so that 
there is no evidence that the test is 
unidimensional. That Zelen and 
Levitt were able to choose a certain 
number to “represent’’ the whole 
test suggests that there may be 
clusters of similar items in the orig- 
inal version. There is, of course, no 
guarantee that the test would cor- 
relate with actual behavior, even 
though it may give a picture of the 
way individuals regard their own be- 
havior; this could be overcome to 
some extent by a comparison with 
friends’ ratings, since most of the 
items refer to instances of behavior 
rather than feelings. The test ur- 
gently requires factor analyzing and 
checking against a criterion for valid- 
ity, and the results so far achieved 
with it suggest that these tasks would 
repay the effort involved in carrying 
them out. 


Other Tests of Rigidity 


One test which has been chosen 
fairly frequently to study rigidity isa 





SHEILA M. CHOWN 


perceptual task, the hidden objects 
test. The S is asked to find as many 
hidden objects as possible in pictures 
similar to those often found in chil- 
dren’s magazines. Numerous hidden 
objects are present and it is supposed 
that rigid Ss will have more difficulty 
in seeing them and so will produce a 
shorter list. Cattell (1946) found 
that this test had the highest loading 
of any on his factor of “disposition 
rigidity.”” This type of test does not 
always correlate highly with the su- 
perficially similar “‘Gottschaldt figures 
test’’ where certain outlines are dis- 
guised in complicated figures and the 
S is asked to pick out the simple out- 
lines. 

Becker (1954) developed the novel 
idea of using aniseikonic lenses, ini- 
tially designed to study distortion in 
stereoscopic perception, to study 
rigidity. The effect of the lenses is to 
“‘make a table appear to tip up like a 
drawing board or to make a wall lean 
towards the observer.”” Becker no- 
ticed that people varied in the time it 
took them to see the distortion and 
in the degree to which they experi- 
enced it. He suggested that more 
“rigid” individuals who manipulated 
the world to conform to their own 
preconceptions, would report less dis- 
tortion and take longer to see it. . 
Learning effects did not occur after 
the first two trials, and both time in- 
terval and degree of distortion ap- 
peared to remain constant from situa- 
tion to situation. Subjects were 
asked to call out as soon as the dis- 
tortion occurred and to estimate how 
much higher the back of a plane sur- 
face appeared to be than the front. 

The method of obtaining the re- 
sults in this test could be improved; 
the Ss could be asked to estimate the 
distortion by moving a board to rep- 
resent the distorted table top instead 
of giving a rough estimate. The ef- 
fects of expecting the distortion have 
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not been studied—just how much 
‘learning’ occurred during the first 
two trials, and why. The rationale 
behind the test is not clear; it can 
easily be argued that rigid people 
should see the distortion more quickly 
instead of less on the grounds that 
they should be less able to manipu- 
late their perception when this is de- 
manded of them. More work is 
needed on the test, but it is one in 
which there are possibilities, since it 
offers opportunity for objective meas- 
urement and comparison of one indi- 
vidual with another. 

Originally Becker noticed that 
children were more susceptible than 
adults to the distortion, and it would 
be interesting to discover whether 
further age differences exist among 
adults. It would also be interesting 
to discover whether this type of per- 
ceptual rigidity is related to set, as 
measured by Einstellung tests, and 
whether ease of change of other hab- 
its, not necessarily perceptual, is re- 
lated to ease of change of perception. 

The California Ethnocentrism 
Scale (Adorno, Frenkel-Brunswik, 
Levinson, & Sanford, 1949) 
shown some slight relationship to 
other measures of rigidity and some 
experimenters have used racial prej- 
udice as a rigidity test in its own 
right. However, though rigidity and 
ethnocentrism may well be related, 
there is no evidence that they are 
synonymous and assumptions of this 
nature must be viewed with suspi- 
cion. The California F scale has also 
been used, in this case the assumption 
being that authoritarianism and ri- 
gidity are closely related. 

The Rorschach test has also been 
used to get a measure of personality 
rigidity. In some cases the protocols 
have been rated by expert judges 
looking for previously decided signs 
of rigidity; in other cases some of the 
actual scores have been taken as pos- 
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sible measures. Total production, for 
example, or the ratio of “part pic- 
tures’’ to ‘‘wholes’’ are two of the 
measures that have been utilized. 
This test seems most suitable as a 
measure of ‘‘creative’’ rigidity; the 
blots impose no limits, and hence 
things such as imagination are bound 
to enter into the results. 


RELATION BETWEEN TESTS 


It is uncertain whether there is any 
generalized trait of rigidity. As Cron- 
bach (1956) puts it, ‘in those studies 
which reject a general factor the re- 
liability of the measures is uncertain. 
In those which find positive relations, 
test-taking attitudes may provide an 
adequate explanation.”” Much ap- 
pears to depend upon the tests of ri- 
gidity used, the type and number of 
tasks involved, and the conditions 
under which testing was carried out. 

In describing the known relation- 
ships between tests, difficulties of 
classification have arisen. Few in- 
vestigations overlap; each experi- 
menter has tended to use a different 
combination of tests, including some 
unique ones. Sometimes, however, 
more than one of the well known 
tests already described have been in- 
cluded. In order to avoid mentioning 
the same investigation more than 
once, the order in which the tests 
were described in the previous sec- 
tion has been used as “‘an order of 
precedence.’ Investigations employ- 
ing Einstellung tests are, therefore, 
described first, then those using con- 
cept formation tasks and so on. 
Under each heading, investigations 
employing more than one major test 
are described before those using one 
major test and several other tests. 

A summary of the relationships is 
given at the end of the section. 


Einstellung Tests 


The water jar test was found by 
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Cowen, Wiener, and Hess (1953) to 
be related to the alphabet maze “‘set”’ 
test (r=.42). They controlled varia- 
bles of age and intelligence and pre- 
sented the two tests in an exactly 
parallel fashion to 59 college under- 
graduates. Neither of the test scores 
was distributed normally; both 
curves were U-shaped. It would have 
been surprising if, given a group with 
the ability to do the type of arith- 
metic and spelling problems involved 
in the two tests, the rigidity scores 
on the tests had not been related. 
Goldstein and Sheerer (1941) used 
the Luchins test, an anagrams test 
(adapted from Rees & Israel, 1935), 
the Shipley-Hartford Retreat Scale, 
and Thurstone attitude scales to- 
wards the Bible, Censorship, Patri- 
otism, and the Law, in order to look 
for social attitudes which could be re- 


lated to Einstellung rigidity. Sub- 
jects were 150 undergraduates. No 
significant intercorrelations were 


found between the two Einstellung 
tests, nor were there any consistent 
relationships between these tests, the 
Shipley-Hartford Retreat Scale and 
the attitude scales. It is surprising 
that the correlation between water 
jar and anagrams (the highest be- 
tween any of the tests) only reached 
r=.17, which is not significant at the 
.05 level of probability. The experi- 
menters themselves point out that 
their Ss may have supposed the ex- 
periments were intended to test 
speed or ability to use all the jars. It 
must also be remembered that the Ss 
were not volunteers and that two ses- 
sions at an interval of a fortnight 
were used to administer the tests. 
Rokeach (1948) endeavored to find 
a factor of rigidity in ethnocentric 
people. He used the Luchins water 
jar test and the California Ethno- 
centrism Scale. He divided his Ss 
into high and low prejudice groups 
according to their scores on the Cali- 
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fornia Ethnocentrism Scale. He 
called those who scored above the 
mean of his group the highly preju- 
diced and those who scored below it 
the low prejudice group. His 70 Ss 
were white American-born under- 
graduates. He found that the high 
prejudice group was more rigid, and 
more liable to think concretely. Us- 
ing a map test (rather like a maze 
test), he confirmed the result ob- 
tained with the water jar test. His 
work has been criticized by Luchins 
(1949) on the grounds of the subject 
matter of the Ethnocentrism test, the 
dichotomized results, and the purely 
verbal nature of the test. He also 
questioned the statement that eth- 
nocentrism implies rigidity and 
pointed out that the eghnocentric 
person may after all be open to 
change. In his view, neither the use 
of scratch paper nor increased ver- 
balization provide’ evidence for 
greater concreteness of thought: Ro- 
keach (1949) answered these criti- 
cisms by admitting that “ethno- 
centrism implies rigidity” is a prem- 
ise, and that social and emotional 
implications existed in the test ma- 
terial. He maintained that the in- 
structions were formulated so as to 
make the Ss watch for ‘“‘short’’ meth- 
ods. He agreed that verbalization is 
not a satisfactory method for meas- 
ing rigidity, but said that the use of 
scratch paper is. The dichotomy of 
the prejudice scores obtained from a 
very selected group might well allow 
a spurious correlation to be found be- 
tween it and a test of rigidity which 
as we already know, gives a U-shaped 
distribution of results. 

Applezweig (1954) carried out a 
fuller investigation of the relation- 
ship between the water jar test and 
other tests of rigidity. These were the 
Rorschach, the Angyal perception 
test, Luchin’s hidden words test, a 
hidden objects test, and the Cali- 
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fornia Ethnocentrism Scale. Sub- 
jects were 79 candidates for a U. S. 
Navy submarine school. Since Ap- 
plezweig was also interested in the ef- 
fects of anxiety on rigidity, the Ss 
were divided into three groups. One 
was tested the day before, one the 
day after, and one a week after the 
candidates’ tests for entrance to the 
school. Her own words seem worth 
quoting here: “Of the 45 correla- 
tions among the six rigidity measures, 
22 are found to be negative, 21 are 
positive, and 2 are zero. Further- 
more, only three of these 45 correla- 
tions reach the .05 level of signifi- 
cance, and two of those three turn out 
to be negative correlations. There is 
with rare exception, no consistency 
between any two measures under all 
three conditions.””’ Applezweig sug- 
gests that the measures may not be 
reliable or they may be measuring 
different things. In fact, she goes on 
to explain the way in which she 
thinks that performance on the differ- 
ent tests may have been affected by 
the amount of stress in the situation. 
One test would appear more threat- 
ening in one situation than another. 
Hence the person would appear more 
rigid on it in the first situation. Of 
course, such arguments after the 
event are difficult to evaluate, unless 
they fit in with other findings. Ap- 
plezweig rightly concludes that she 
has ‘‘no evidence of generalized ri- 
gidity on six tests claimed to measure 
this trait,’’ but that she has some evi- 
dence that feelings of insecurity or 
anxiety will affect performance on 
tests of rigidity. Her work may be 
criticized on the grounds that not all 
types of rigidity test were included 
in her six. The results are more in- 
teresting, however, in that they con- 
tradict the findings of Rokeach and 
those of Cowen and Thompson now 
to be described. 

Cowen and Thompson (1951) tried 
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a direct attack on personality varia- 
bles and rigidity. They gave the 
water jar test, the Bell and California 
personality inventories and the Ror- 
schach to 93 children in the eighth 
grade. On the basis of their scores 
on the water jar test, a rigid and a 
flexible group were distinguished. 
Those Ss who made mathematical er- 
rors and those who gave complex so- 
lutions to an initial control problem 
were eliminated, so that only 34 Ss 
were actually used. Of these, 17 were 
“flexible” and 17 were “‘rigid.”’ It was 
found that neither of the personality 
inventories were related to rigidity- 
flexibility. -The Rorschach showed 
up certain differences between the 
groups in that the rigid group pro- 
duced fewer responses, were less able 
to organize and integrate the blots, 
and gave fewer “color responses.” 
The protocols were subjected to the 
ratings of judges for ‘‘over-all rigid- 
ity’’ and these were significantly re- 
lated to the rigid and flexible groups. 
In this investigation the Ss were a 
highly selected group of children who 
were good at arithmetic. It would 
not be safe to generalize on the basis 
of these results as Applezweig’s work 
in 1955 has shown. In fact, the re- 
sults of Rokeach, Applezweig, and 
Cowen and Thompson leave the rela- 
tionship between Einstellung tests, 
the ethnocentrism test and Rorschach 
test uncertain. 

Harway (1955) who worked on 
goal-setting behavior as an aspect of 
rigidity, suggested that rigid and 
nonrigid groups would behave differ- 
ently in a level of aspiration test. 
This followed from his acceptance of 
Goldstein’s concept of secondary 
rigidity. He thought that the rigid 


group might obtain either higher or 
lower scores on measures of varia- 
bility of goal-setting. The water jar 
test was used to measure rigidity, and 
the Rotter aspiration board, another 
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water jar task and a hidden word task 
were given as measures of level of as- 
piration. A rigid and_ nonrigid 
group were formed from the sixty-six 
subjects who successfully completed 
the water jar test. His results showed 
that “rigidity in problem solving 
seems to be related to greater varia- 
bility of goal-setting behavior’ but 
there was a big overlap between the 
scores of the two groups. 

Levine (1955) tried to find a rela- 
tion between the water jar test scores 
of 100 male veterans and their time 
scores on a simple discrimination 
task, in which they had to say which 
of two figures was the larger. In 12 
out of the 22 cases, the figures were 
the same size. He found no relation- 
ship between the two scores, but as 
he points out it is only an assumption 
that a simple discrimination will 
measure impulsiveness or caution. It 
is also not proven that the water jar 
test measures “‘rigidity’’ in this 
sense. 

Guetzkow (1951) found no rela- 
tionship between water jar perform- 
ance and ability to solve the Maier 
pendulum problem by as many dif- 
ferent methods as possible. This was 
a good illustration of the fact that the 
ability to solve a given mathematical 
problem may be different from that 
required to seek out alternative solu- 
tions to a problem. In the one case, 
the S knows the end towards which 
he is working; in the other, the num- 
ber of possible solutions is not fixed. 
Guetzkow also showed that “‘suscep- 
tibility to set’’ and “ability to over- 
come set’’ are two different things; 
that men and women are equally sus- 
ceptible to set, but that men appear 
to be better able to overcome it. 

Wolpert (1955) used five rigidity 
tests which were all supposed to be 
reliable, and sought for a general 
factor among them, They were the 
water jar test, the applicability of ad- 
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jectives to self and friends, the appli- 
cability of adjectives to various social 
situations, narrowness of attitude to 
the world, and the stability of repro- 
duction of lines. The tests were given 
to 38 volunteer students, and com- 
parisons were carried out between 
their rank order rigidity scores. No 
relationship between the tests proved 
significant, and Wolpert therefore 
concluded that there is no generalized 
factor of rigidity. His choice of tests 
may be questioned: stereotypy of 
description may not indicate rigidity 
of character or of intellect. His scor- 
ing of the water jar problem is also 
open to criticism. Does “the number 
of old method solutions to those prob- 
lems solvable by a shorter method 
plus the number of failures to solve 
those problems where the old method 
is completely inappropriate,” really 
constitute a satisfactory measure of 
rigidity? Other workers consider 
with good reason that ‘“‘susceptibility 
to set’’ and “‘ability to overcome set”’ 
are two different things: It looks as 
though Wolpert’s conclusion that 
“rigidity is not a generalized factor,” 
takes too much for granted. 

Oliver and Ferguson (1951) car- 
ried out a factor analysis of a number 
of so-called rigidity tests, from which 
they found three factors. One of 
these was “habit interference or rigid- 
ity.” The tests which were loaded 
highly on this factor were substitu- 
tion of the normal meaning of arith- 
metic signs, giving the letter so many 
behind a given one in the alphabet, 
giving the opposites of the seasons or 
not, according to the printing, figure 
analogies and a same-opposites test. 
Those not included in the factor were 
rewriting mirrored words in the nor- 
mal way, anagrams, a number series, 
the Gottschaldt figures test and the 
Luchins water jar test. The tests in- 
cluded in the factor all need the re- 
versal of a previously over-learned 
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habit, and, therefore, it seems possi- 
ble that the factor is one of ‘‘disposi- 
tion rigidity.”’ It is of interest that 
at least two common Einstellung 
tests of rigidity, anagramis and the 
Luchins water jar, are not included in 
the factor. 

This is, perhaps, the most useful of 
all the investigations employing the 
water jar test, since it has suggested 
that Einstellung tests may differ in 
kind from other tests of rigidity. It 
is the more unfortunate that the bat- 
tery did not include a test known to 
be high in g and some of the more 
commonly accepted ‘‘disposition ri- 
gidity”’ tests. 

Pitcher and Stacey (1954) used the 
Guilford-Zimmerman temperament 
survey, and an Einstellung test of 
similarities between words, in, their 
attempt to see whether rigidity could 
be described as a general personality 
trait. The similarities-between-words 
test was so designed that the first 
seven items should have elicited con- 
crete or functional similarities, the 
next two, abstract similarities, while 
the last four could be answered in 
either way. The 373 Ss were divided 
into seven groups, ranging from those 
who gave the most rigid—that is the 
greatest number of functional sim- 
ilarities—to least rigid or most ab- 
stract. The scores on the 10 traits of 
the Guilford-Zimmerman scale were 
compared by analysis of variance to 
see whether there were any differ- 
ences related to rigidity. Two sig- 
nificant differences were found, on 
Ascendance and Masculinity. The 
latter varied according to the respec- 
tive numbers of male and female Ss 
within the seven groups. The main 
contributions to the significance of 
Ascendance came from the least 
rigid, who scored more highly on this 
trait. Pitcher and Stacey point out 
that this agrees with Cattell’s find- 
ings that the more rigid are the more 
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submissive. However, one should not 
generalize from disposition rigidity 
tests to Einstellung tests in this way, 
without further direct evidence that 
the two are related, particularly in 
view of Oliver and Ferguson’s work 
in 1951. The results were obtained 
from one test which had not been 
studied in relation to any other tests. 
The results ‘“‘do not support the hy- 
pothesis of a generalized rigidity 
factor,”’ but neither do they refute 
this hypothesis. 

French (1955) gave the water jar 
test, California F scale, a changing 
figures test, preferences for design 
and two closure tests to 50 airmen 
under stress and 50 under nonstress 
conditions. She concluded that ego- 
involving conditions did not produce 
an increase in rigid behavior and 
that there was no evidence of gen- 
eralized rigidity among the tests. 
Only the two closure tests were sig- 
nificantly related under both stress 
and nonstress conditions. 


Concept Formation Tests 


Wesley (1953) gave a card-sorting 
task requiring change of concepts 
without warning to Ss chosen accord- 
ing to their scores on the Wesley ri- 


gidity inventory and the Taylor 
Manifest anxiety scale. There were 
21 rigid .Ss, 21 anxious Ss, and 30 nor- 
mal Ss. All took equally long to learn 
the initial concept, but the rigid 
group took significantly longer to 
shift its set (P=.04). The rigid 
group also made significantly more 
perseverative responses than the 
anxious group (P =.02), and the nor- 
mal group (P=.05). The rigidity 
scores were, by the nature of the 
sampling, not related to scores on the 
Taylor Manifest Anxiety scale, and 
the scores on the latter did not seem 
to show any clear relationship to the 
sorting test. From these results it 
seems that it is possible to find rigid 
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people who are not anxious and anxi- 
ous people who are not rigid, though 
Gaier (1952) found that the two went 
together, at any rate at the time of 
testing. 


Wesley Scale 


The Wesley inventory was also 
used by Zelen and Levitt (1953) in an 
attempt to find out whether their 
group level of aspiration test could 
be regarded as a measure of rigidity. 
The other tests were the California 
Ethnocentrism Scale, and the Rotter 
board. All three measures proved to 
be significantly related to their test. 

Jones (1954) measured the scores 
on the California F Scale against the 
rate of fluctuation in the Necker cube 
illusion, the Guilford-Zimmerman 
Temperament Survey, and the Wes- 
ley Manifest Rigidity Scale. There 
was a significant negative correlation 
between the California F Scale and 
the rate of fluctuation of the Necker 
cube; a significant positive correla- 
tion between the F scale and two 
anxiety and two hostility scales on 
the Guilford-Zimmerman _ survey; 
and a significant positive correlation 
between the F scale and the Wesley 
Manifest Rigidity Scale. 


Hidden Objects 


Pullen and Stagner (1953) adopted 
Cattell’s definition of rigidity and 
used four of the tests suggested by 
him in order to measure it. These 
were a motor test of creative effort, a 
hidden objects test, riddles and 
flicker fusion. Hidden words were 
also used, and new tests incorporated. 
These were naming colors finely 
graded from one shade to the next, 
naming changing figures, and naming 
changing pictures. The Wechsler- 
Bellevue was included in the battery 
as a reference test. The rigidity score 
for the last three tests mentioned was 
the number of instances in which the 
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Ss gave conflicting opinions when the 
colors, figures, or picture series were 
started at different ends of the con- 
tinuum. Subjects were 60 psychotics, 
35 of whom received convulsive shock 
therapy. Two factors—“rigidity” 
and “‘intelligence’’—were found on 
centroid analysis. The changing 
colors, figures and creative effort sub- 
tests were the most heavily loaded on 
the rigidity factor while hidden ob- 
jects, hidden words, riddles, and lack 
of flicker fusion overlap contributed 
mainly to the intelligence factor. 
After shock therapy, the rigidity 
scores decreased. This rigidity factor 
seems likely to be the same one that 
Cattell called “disposition rigidity,” 
although not all Cattell’s tests were 
loaded on their rigidity factor. This 
is, in fact, one of the more disturbing 
factor analytic studies because of the 
high negative loading of four “‘rigid- 
ity”’ tests on intelligence. 

Cattell (1946) himself found that 
writing tests, in which the Ss had first 
to do a familiar task and then to alter 
it so that it became unfamiliar, were 
heavily loaded on the disposition 
rigidity factor. When he gave the 
same 100 Ss his 16-personality factor 
test, he found that this type of rigid- 
ity was related to Submissiveness, 
Character Integration, and, to a 
lesser extent, Stability and Surgency. 


Anisetkonic Lenses 


Becker (1954) using aniseikonic 
lenses, found that time delay in seeing 
the distortion and the degree of dis- 
tortion experienced were related to 
certain Rorschach measures, among 
them the weighted scale developed 
by Fisher (1950). 

Martin (1954) also experimented 
with aniseikonic lenses and found 
that those quick to perceive distor- 
tion asked few questions in an am- 
biguous interview situation. This 
suggests that asking few questions 
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may be a measure of flexibility and 
ease of adjustment rather than of 
rigidity. 


Ethnocentrism 


Using the California Ethnocen- 
trism Scale and an unpublished Toler- 
ance of Ambiguity Scale consisting of 
questions about change and _per- 
severance, which was devised by 
Walk, O’Connor (1952) found that a 
complex relationship existed between 
these two and ability to carry out 
syllogistic reasoning. Intolerance of 
ambiguity was related to poor ab- 
stract reasoning only when it was 
associated with ethnocentrism. 

Rokeach (1951) found that ab- 
stract definitions of religion and poli- 
tics were least common in the most 
racially prejudiced. On the other 
hand, Erikson and Eisenstein (1953) 
found that ethnic prejudice was not 
related to the number of hypotheses 
a person would put forward to am- 
biguous stimuli. Thus, though ethno- 
centrism may be related to formalized 
reasoning, it does not appear to be 
related to what might be called ‘‘crea- 
tive reasoning.’ This agrees with the 
results found to hold for the water jar 
experiments. 


Rorschach 


The experiment by Eriksen and 


Eisenstein (1953) included an at- 
tempt to get the S to accept certain 
concepts in the Rorschach. They 
found that the tendency to reject 
these concepts was inversely related 
to the number of hypotheses a person 
would offer to an ambiguous stimulus. 
In some ways, this finding is surpris- 
ing; one would not expect that readi- 
ness to accept the concepts offered 
would go with an ability to produce 
numerous concepts on demand. In- 
deed, what might be called ‘‘passive”’ 
tests of rigidity have not in other in- 
stances linked up to the ‘“‘creative’’ 
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ability to seek out solutions or pro- 
duce ideas. 

Gaier (1952) has followed up this 
line of reasoning by looking for rela- 
tionships between the Rorschach 
which he used to gauge anxiety, rigid- 
ity and negativism, and the kind of 
thinking (‘‘rote’”’ or ‘‘thought’’) pres- 
ent in a learning situation. His Ss 
were 11 college freshmen of approxi- 
mately equal ability who were inter- 
viewed about their reactions during 
class while hearing a recording of that 
class. Their first year examination 
performance was analyzed according 
to the type of cognitive processes they 
had used. Their Rorschach protocols 
were examined and ranked by two 
experts for anxiety (‘‘whether self en- 
grossed and concerned with personal 
adequacy’’), rigidity (F+ and A re- 
sponses) and negativism (white space 
responses). Anxiety was found to be 
significantly but negatively related to 
performance when comparing famil- 
iar and unfamiliar material (rho 
=-—.61) and positively with the 
amount of thought given to the self in 
class (rho = .60). There was a positive 
correlation between high rigidity rat- 
ing and rote recall (rho=.73) and a 
negative correlation between high 
rigidity rating and problems or ideas 
requiring new methods of attack. 
Gaier concludes that ‘Anxiety and 
rigidity characterize those individuals 
less capable of improvising in a new 
problem situation.’’ Of course there 
could be other reasons for the lack of 
creative thought, such as low ability, 
or low fluency of thought, and Gaier’s 
work gives no clue as to which comes 
first, rigidity or anxiety. 

The number of white space re- 
sponses on the Rorschach was also 
used by Bandura (1954) who found it 
to be related to the rate of fluctuation 
of the Necker cube—a favorite test of 
sensory perseveration if not of 
“rigidity.” 
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Fisher (1950) attempted to use a 
“rigidity profile’ consisting of the 
Vigotsky block test, change in level 
of aspiration during a hand steadiness 
test, liking for colors, trait judgments 
from photographs, comparison of self 
with pictured persons, number of in- 
terpretations to TAT pictures, num- 
ber of things found annoying, range 
of interests, Rosenzweig picture frus- 
tration test, and blot representation 
test. Against this profile he used the 
Rorschach. His first measure was of 
“personality rigidity,’’ diagnosed by 
such signs as restricted responses and 
limited use of color. His second meas- 
ure was of “personality maladjust- 
ment” as shown by various “clinically 
recognized signs.’’ He also used Guil- 
ford’s S.T.D.C.R. test, to get “a uni- 
form means of analyzing an Ss way of 
describing herself'’—that is, a meas- 
ure of the Ss favorable or unfavorable 
descriptions of herself. There were 60 
female Ss, divided into normal, 
paranoid, and hysteric groups. Very 
little overlap occurred between the 


subtests of the rigidity profile and 
there was no consistent overlap be- 
tween the different subject groups. 
This is hardly surprising, in view of 


the assorted mixture of the tests, 
some of which would not be generally 
recognized as measuring rigidity at 
all, but rather narrowness of outlook. 

He found that his rigidity profiles 
did correlate with the Rorschach 
measure of rigidity, and with the “un- 
favorable” S.T.D.C.R. score, and 
that in general, the more disturbed 
the patient, the higher her “rigidity” 
score. He makes the point already 
mentioned by Goldstein that rigid 
people in one situation may react by 
extreme flexibility in another and 
that it may be that “personality 
rigidity may manifest itself in diverse, 
often unrecognized ways.’ He sug- 
gests that three levels of emotional 
stress are involved in his tests. Some, 


SHEILA M. CHOWN 


like the Rorschach, cannot be an- 
swered without a great deal of emo- 
tional impact, while others, such as 
liking for colors, contain no emotional 
significance. Nothing is done to show 
whether this analysis is consistent 
with the data. 

In view of the assortment of tests 
used, it may well have appeared that 
no single one of these measures is suc- 
cessful as a measure of general rigid- 
ity, but Fisher was not justified in 
stating on the basis of his results that 
the concept of generalized rigidity is 
largely fictitious. 


Flexibility in Thinking 


In a factor analytic study of flexi- 
bility in thinking Guilford, Frick, 
Christenson and Merrifield (1957) 
found two flexibility factors called 
“spontaneous flexibility” (i.e. free- 
dom from inertia in thinking) and 
“adaptive flexibility” (i.e. restruc- 
turing of interpretations and ap- 
proaches). These factors are not 
thought likely to be the same as any 
psychomotor rigidity factor, but per- 
ceptual flexibility seems to be a form 
of adaptive flexibility. 

Thirty-two test scores were ob- 
tained from 28 tests, administered to 
208 air cadets. The test battery took 
six hours to complete. Eleven of the 
tests were new, and five were “ref- 
erence tests."’ The tests which had 
loadings of above .30 on spontaneous 
flexibility, and on this factor only, 
were: 

“Brick uses’’—think of as many 
different uses as possible for a com- 
mon brick. 

“Object naming’’—write as many 
objects as possible belonging to a cer- 
tain class. 

‘Unusual uses’’—list other uses for 
a common object for which a stand- 
ard use is given. 

“‘Impossibilities’’—list all the im- 
possibilities that can be thought of. 
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Tests having loadings above .30 on 
adaptive flexibility only were: 

“Match problems’’—remove_ a 
specified number of matches from a 
given design to leave another named 
design, by several methods. 

‘‘Penetration of camouflage’’—find 
human faces hidden in a series of pic- 
tures. 

‘““Squares’’—place a specified num- 
ber of crosses on a checkerboard so 
that no two are in the same row, 
column or diagonal—give alternative 
responses. 

Of these tests, only the “hidden 
faces” has any parallel with those 
previously discussed. 

Tests which have appeared in 
other research and which appear in 
this study in other factors than flexi- 
bility are: 

Hidden words (verbal comprehen- 
sion and structural redefinition). 

Water jars (reasoning and logical 
evaluation). 

Sign changes in arithmetic (reason- 
ing). 

Riddles (originality, if ‘‘clever,”’ 
and ideational fluency, if ‘‘obvious’’). 

It may well be that this distinction 
between spontaneous and adaptive 
flexibility in thinking is as im- 
portant as that drawn by Cattell 
between process rigidity and struc- 
tural rigidity in psychomotor per- 
formance. The distinction has al- 
ready been drawn more than once in 
this article between situations which 
impose a bounded area for adaptation 
and those which give a _ limitless 
choice and which seem to require 
some sort of creative ability. Adap- 
tive flexibility seems to resemble the 


‘ 


former case and spontaneous flexibil- 
ity, the latter. 


Summary and Discussion 


It should be noted that 47 tests 
have been mentioned, and that the 


overlap between experiments has 
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been extremely small. Where two 
people have used the same two tests, 
their results hardly ever agree and it 
is hard to say whether this is due to 
faults in the tests or discrepancies in 
the conditions, administration, and 
scoring of the tests. 

Perhaps the most obvious need in 
the whole field is for a study of the 
relationship between the well-known 
tests of rigidity—an Einstellung test, 
concept formation test, personality 
test, perceptual test, and disposition 
rigidity tests. However, the writer, 
like previous investigators, feels that 
the design and scoring of some of 
these tests should be altered. Even 
if no new tests were introduced into 
such a battery it would not be com- 
parable with previous batteries. It 
would be consistent within itself in 
that each test would seek to measure 
the difficulty of change of habits in the 
face of new demands to change. 

Cattell’s warning that each new 
aspect of rigidity studied should be 
linked to the factors already known 
should be remembered; for example, 
a measure of intelligence should be 
included in any battery to serve as an 
anchor for a factor analysis. 


E-XPERIMENTAL WoRK 
Training 


Many workers have tended to 
ignore the quest for a general factor 
of rigidity. They have concentrated 
on situations which elicit rigid be- 
havior and the methods by which 
such behavior may be modified. The 
first group of experiments described 
here deals with the effects of different 
“training experiences.’’ When a per- 
son has built up a concept as a result 
of his own experiences, under what 
circumstances and in what ways can 
he alter it most easily? 

T ype of alteration achieved. Several 
experiments have been carried out on 
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the type of alteration which is most 
easily achieved. Walk (1952) used 
wooden blocks of different size, shape, 
and color, and later photograghs of 
various facial expressions and poses 
in a concept formation test. He gave 
a third of his Ss training in one prin- 
ciple, a third training in the opposite 
principle, and a third no training. 
The first principle was then used for a 
criterion run. The group which had 
to reverse the principle learned dur- 
ing training gave the worst perform- 
ance on the criterion test. The experi- 
ments show that, as might be ex- 
pected, it is harder to alter a concept 
completely than it is to use the orig- 
inal one, or than it is to develop a new 
one. No tests were given in which 
training was of such a nature that the 
concept had to be partially altered on 
the criterion run. 

Buss (1950, 1952, 1953a, 1953b) 
gave college students a concept for- 
mation task which involved block 
sorting. He found that concept re- 
versal was easier than having to de- 
velop an unrelated concept or one 
overlapping with the original vari- 
able. Buss suggests that discrimina- 
tions of this type are learned by the 
differential reinforcement of the vari- 
ous stimuli. The number of stimuli 
of one kind presented in the learning 
series and the kind of blocks given 
in the criterion series both influence 
the success of the Ss. 

Kendler (1953) carried out similar 
experiments using the New York 
University card sorting test, and ob- 
tained similar results. However, he 
attempted to test out his own ex- 
planation of the easier reversal shift. 
This is that concept formation sorting 
tests depend on the ability to ver- 
balize the concepts. Those easier to 
verbalize will be learned more quickly. 
His crucial experiment involved omit- 
ting all the cards in the learning of the 
second concept which would give 
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partial reinforcement to the first con- 
cept. However, there was no signifi- 
cant difference between the reversal 
and nonreversal groups, and it proved 
difficult to evaluate the results since 
the correct sorting for both reverse 
shape and reverse color were the 
same. 

The important thing about this 
work is that Buss and Kendler ob- 
tained similar results using different 
materials and working independently. 
In the formation of concepts, it seems 
that direct reversals of concepts al- 
ready known are easier to learn than 
partial alterations. This may have a 
possible application in the field of 
learning and retraining. 

Effect of reinforcement. The effects 
of the type of reinforcement given 
during learning is not at all clear. 
Buss, in the work already quoted, 
found that continuous reinforcement 
led to Jess readiness to reverse the 
previously learned discrimination. 
However, Sheffield (1949) worked 
with rats and found that the rate of 
extinction of alley running habits was 
greater for those whose responses had 
been reinforced all the time. In other 
words, they appeared to give up their 
habit more readily if it had been con- 
tinuously reinforced and then sud- 
denly not reinforced at all. 

It seems that “‘reversal’’ and “‘ex- 
tinction” are different phenomena, 
but that continuous reinforcement 
leads the Ss to feel very sure of the 
“rightness” of the choices they are 
making. 

Effect of criterion run. The length 
of the successful criterion run also ap- 
pears to influence willingness to 
change a concept. 

Grant and Berg (1948) used the 
Wisconsin card sorting test and found 
that a group which had been allowed 
to continue longer with a series of 
consecutive correct responses had less 
difficulty in changing when the ex- 
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perimenter altered the correct sorting 
category without warning. A similar 
result was found by Kendler; those 
with the greatest number of correct 
responses dropped the response most 
quickly. On the other hand, they also 
proved quicker at relearning the same 
concept later. It seems that those 
who are surest of their concept in the 
first place are the quickest to realize 
that the concept has been changed 
and that they need to abandon the 
idea which was previously correct. 
Effect of ‘‘set.”’ In situations which 
involve ‘‘set’’ however, those who 
have become most practiced in the 
“‘set”’ method are often least willing 
to abandon it. The number of arith- 
metic problems given by Youtz 
(1948) to form the “‘set”’ ranged from 
five to 40. He then introduced a new 
series of problems which were in- 
soluble by the ‘‘set’’ method. He 
found that the greater the number of 
“‘set’’ problems the greater the time 
taken on the new type of problems. 
No data were given on the rate of 
increase of speed on the new prob- 
lems, and it seems likely that “‘in- 
sight’”” occurred fairly early in the 
“test’’ series. Maltzman and Mor- 
risett (1952) showed a similar effect 
to be present in the solving of in- 
compatible anagrams. On the other 
hand, Tresselt and Leeds (1953a) dis- 
covered no difference in the number 
of criterion problems solved when the 
number of “‘set’’ problems was in- 
creased beyond a certain limit. In 
their experiment, the best number of 
“set’’ problems proved to be between 
six and eight. It seems possible that 
a break in the series may have a 
definite effect here, in some way 
breaking the habit of response more 
for less practiced responses. Or again 
it may be that time taken is a more 
sensitive measure than number solved 
correctly, especially if it is a question 
of “‘insight’’ and realizing that a new 
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method of solution is needed. 

Both Rees and Israel (1935) and 
Hunter (1956) have shown that 
specified sets help in solving anagram 
problems. The more specific the set, 
the more quickly their Ss solved the 
anagrams. Weaver and Madden 
(1949) concluded that in solving 
problems such as Maier’s “pendu- 
lum,”’ Ss used only directions of 
thought and habits of search with 
which they were already familiar. 
Hints about “direction” and “‘parts”’ 
of the solution enabled more Ss to 
solve the problem. 

Effect of practice. The type of prac- 
tice seems to influence the degree of 
rigidity shown in problem solving, and 
the degree to which a “‘set”’ is devel- 
oped. For example, Kendler, Green- 
berg, and Richman (1952) showed 
that massed practice rather than 
distributed practice favored the de- 
velopment of a “set”’ in solving arith- 
metic problems. Half their Ss were 
given the problems under massed 
practice conditions with no interval 
between the solution of one problem 
and the presentation of the next, 
while the other half had a three min- 
ute interval between successive prob- 
lems. The strength of the set was 
measured by a test problem which 
could be solved by either the set or 
nonset method. Sheffield (1949) 
working with alley-running rats, 
found that the extinction rate was 
dependent upon the reinforcement in 
massed training but not in spaced 
training. Kendler (1953) found that 
mental sets were more effectively 
weakened when they were tested 
under massed practice conditions. 
His first 20 anagrams were all food 
words, all solvable in the same way. 
They were presented to three groups 
of 10 Ss at 15-second intervals. Dur- 
ing the test series, the Ss were given 
nature words not solvable in the 
“set’’ way, one group with no in- 
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terval between solution and presenta- 
tion, and the second group with an 
interval of 30 seconds. Over the first 
six anagrams of the test series, the first 
group had decidedly fewer failures 
than the second group. After this the 
difference decreased. Taken together, 
these experiments suggest that 
massed practice favors the develop- 
ment of set but that massed practice 
during testing or during extinction 
tends to favor extinction. Such find- 
ings do not necessarily apply to the 
solving of problems other than these 
short ones involving ‘‘set.”’ 

For instance, Shaklee and Jones 
(1953) divided three hundred college 
students into three groups and gave 
them a verbal reasoning problem. 
The first group received massed prac- 
tice for 20 minutes, the second re- 
ceived practice of 60 seconds work 
interspersed with 30 seconds rest, and 
the third, practice of 60 seconds work 
and 90 seconds rest. They found that 
fewer of the group with 30 seconds 
rest solved the problem, but that the 
time taken by those who reached a 
solution did not differ for the three 
groups. There was no relationship 
between the length of the rest periods 
and the number of incorrect solutions 
offered. 

Preparation for problem-solving. 
There has been a good deal of dis- 
cussion as to the best way in which 
to prepare people for problem solving 
tasks. Adams (1954) found that a 
group trained on repeated presenta- 
tions of the same problem was more 
proficient in solving new problems of 
the same class than a group trained 
on a number of different problems. 
The latter, however, had a smoother 
transition to new problems than the 
former. Harlow (1949) holds that 
training on a number of different 
problems leads to the growth of 
learning sets, or ideas of how prob- 
lems should be tackled, so that such 
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training actually widens adaptability. 
Schroder and Rotter (1952) agree 
with this. They used a card sorting 
task with four groups of Ss and they 
altered the training in expectancy of 
change given from group to group. 
Flexibility, they say, consists of ex- 
pecting changes and looking for al- 
ternative pathways all the time. 
Thus it is training in expectancy of 
change which is required, and not 
training in one single solution satis- 
factory at the moment. Just giving 
no instructions is not enough to pre- 
vent a “set” being built up: Postman 
and Leytham (1951) found that a se- 
lective set may be induced either by 
instructions or through experience on 
the task. 

Again, the probability of responses 
being correct was found to influence 
the solutions to matching problems 
by Goodnow and Postman (1955). 
In this task, the Ss were not told that 
the probabilities varied or what they 
were. 

Limited previous experience can 
itself have a harmful effect when new 
problems are faced. Duncker demon- 
strated that inability to use an object 
for a “‘strange’’ purpose may be due 
to the previous use made of the ob- 
ject. Birch and Rabinowitz (1951) 
also showed this effect, and three of 
Duncker’s experiments were repeated 
by Adamson (1952) with the same re- 
sults. Adamson and Taylor (1954) 
showed that this “functional fixed- 
ness’’ is related to inability to over- 
come set on the Luchins water jar 
test (i.e. the extinction problems), but 
not to susceptibility to set. They 
also found that the longer the time 
interval between the prior use of the 
“fixed” article, the more chance 
there was that the S would overcome 
the fixation and see it in its new con- 
text. Rokeach (1950) has shown that 
a delay of 10 to 60 seconds between 
presenting an Einstellung problem 
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and allowing the S to write down the 
answer cuts down the number of 
“rigid” answers received. This points 
to the importance of allowing time to 
think—a factor which may normally 
depend either on the imposed phys- 
ical situation, or on some personality 
variable. However, using intervals of 
one to seven days between practice 
and ‘‘critical’’ problems seemed to 
increase the susceptibility to set in 
an experiment by Tresselt and Leeds 
(1953b). Here, obviously, factors of 
memory are influencing the situa- 
tion. 

Summary and discussion. The ex- 
periments just described have dealt 
with the effects of previous experi- 
ence. 

It has been shown that reversing a 
‘ previous concept is easier than par- 
tially altering it and that both these 
are, of course, harder than sticking to 
the original concept. 

It has been demonstrated that 
specificity of set, or training in a par- 
ticular solution, leads to maximum 
efficiency so long as problems are con- 
fined to ones which require that solu- 
tion. Where a variety of problems 
have to be faced, a wider training 
with emphasis on the need for change 
is advisable. In this connection, the 
effects of limited experience and the 
previous use of an object in one con- 
text only have been illustrated, and it 
has been suggested that solutions to 
problems are, in fact, evolved from 
previous experience rather than 
thought out from first principles. 

The experiments have not settled 
all the problems that they have 
tackled. For example, the effects of 
different types of reinforcement are 
not entirely clear. In most of the ex- 
periments, reinforcement given all 
the time appears to make Ss surest 
of their concept. When the experi- 
menter changes his criterion, the Ss 
will drop the concept most quickly 
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and also relearn it most quickly if 
their responses have been reinforced 
all the time. But Buss found that 
Ss given 100% reinforcement while 
learning a concept appeared to be 
less ready to change it. It is not clear 
why this should be so, and further 
work is needed on this point. 

Again, the relative effect of massed 
and distributed practice on “set” 
problems is not certain. It seems 
from the experiments quoted that 
massed practice favors the develop- 
ment of ‘‘set,”’ yet it also tends to 
favor extinction later. The important 
variable here seems to be the point at 
which the massed practice is experi- 
enced. 

Similarly the effect of the length of 
the series of “‘set’’ problems in Ein- 
stellung tests is not clear. In some 
experiments, the longer the series, 
the greater the difficulty of change. 
In other experiments there appeared 
to be an optimum number for the 
“set’’ series. The conflict here seems 
to be due to the different methods of 
measuring “‘difficulty of change” and 
could probably be resolved fairly 
easily by the use of a ‘‘time for solu- 
tion” criterion instead of a ‘‘correct 
solution” criterion. 


State of the Subject 


Anxiety. The degree of rigidity in 
problem solving depends partly on 
the state of the S at the time, and 
anxiety is one factor which has been 
investigated. Some experimenters 
have induced anxiety by giving dis- 
turbing and false details about their 
Ss’ personality test results. Cowen 
(1952a, 1952b) used this technique. 
A variation in this procedure is to 
build up feelings of ego involvement 


in the task. Marks (1951), Osler 
(1954), Beier (1951), and Brown 
(1953) have used such methods. 


When anxiety was aroused, the degree 
of rigidity shown was found by Brown 
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to be related to authoritarianism. 

Other investigators, such as Maltz- 
mann, Fox, and Morrisett (1953) 
have examined the relationship be- 
tween Einstellung rigidity as meas- 
ured by the water jar test and an 
anagrams test and generalized anx- 
iety as measured by the Taylor 
Manifest Anxiety Scale. They found 
that the tendency to shift from the 
“set” solution was less for those with 
high anxiety scores. Applezweig 
(1954) found that the anxiety in- 
duced during the selection of sub- 
marine candidates affected their per- 
formance on rigidity tests, though it 
did not in all increase the 
amount of rigidity displayed. 

Ross, Rupel, and Grant (1952) 
gave the Wisconsin card sorting test 
under four different conditions and 
found that only irregularly given 
electric shocks affected the Ss’ ability 
to form and change the correct sort- 
ing concepts. The other conditions 
of testing were an impersonal situa- 
tion, personal heckling, and imper- 
sonal auditory distraction. The 
“shock” situation is the only one of 
these which seems likely to have 
caused anxiety. 

Needs. Reaction to ‘needs’ such 
as thirst or hunger have been found 
to be related to certain types of 
perceptually rigid behavior by Klein 
(1954) and by Postman and Crutch- 
field (1952). Klein demonstrated 
that those who could ignore a color 
word when naming another color in 
which the word was typed, also gave 
less thirst-associated words in re- 
sponse to a thirst cue word when 
they were actually suffering from 
thirst. Postman and Crutchfield 
demonstrated that the relationship 
between intensity of hunger and fre- 
quency of food responses depended 
on their Ss’ selective set for such 
responses. Both investigations found 
that needs could influence set and 
rigidity. 


cases 
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Sensory deprivation. McAndrew’s 
work (1948) on the deaf and blind 
should be mentioned here, though he 
uses the Lewinian concept of rigidity 
which means that his findings must 
be interpreted with caution. He 
worked with 25 deaf, 25 blind, and 
25 normal children. He found that 
the deaf and blind took longer to 
become tired or “satiated”’ with sim- 
ple modelling tasks and that the deaf 
had more difficulty in restructuring 
in classification tests. Deaf children 
more often rejected Rorschach cards 
than normal children and they gave 
more ‘‘all or none” reactions to a 
level of aspiration task. From this he 
deduced that both deaf and blind 
children displayed more “rigidity” 
than normal ones, and that rigidity 
is a positive function of isolation. De- 
prived children probably have nar- 
rower worlds and more limited vo- 
cabulary than normal ones and this 
lack of experience might well account 
for their poorer reclassifications and 
responses in tasks which require 
verbalization. 

Age. Kounin (1941) and Werner 
(1946b) have both investigated rigid- 
ity in children. Kounin came to the 
conclusion that it increased with age, 
and Werner that it decreased. Kou- 
nin holdsthe Lewinian view of rigidity 
He contrasted normal children with 
feeble-minded adults of the same IQ. 
Thus there is some doubt whether age 
or feeble-mindedness is the chief vari- 
able here. 

Bromley (1953) dealing with aged 
adults, found that they were unable 
to shift or hold two aspects of the 
same pattern simultaneously. In re- 
sponding to Raven’s Progressive 
Matrices they gave more personal as- 
sociation, more global ideas, and more 
simplified and concrete answers. 
Thus the aged may be called more 
“rigid.” 

Berg (1948) in a change of concept 
study using a card-sorting test, found 
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that 22 Ss between 58 and 73 were 
unable to shift concepts when the 
experimenter altered his criterion 
without warning. What is more, if the 
Ss did not at first hit on the same 
criterion as the experimenter they 
seemed unable to explore the other 
possible criteria. Among a group of 
college students, on the other hand, 
15 were successful and verbally recog- 
nized that the experimenter had 
shifted, 21 were successful though not 
sure what the experimenter had done, 
and 15 failed to shift or to recognize 
the experimenter’s actions. It seems 
that in this sort of activity, older 
people may be less good than younger 
ones—but they might have been able 
to perform as well if told to look for 
changes. Some support for this idea 
is provided by Heglin (1956). He 
studied the performance of different 
age groups on the water jar and al- 
phabet maze Einstellung tests, both 
with and without training. Training 
consisted of instructions to look out 
for and avoid the influence of a set 
approach. He found that before 
training the older Ss showed more set 
than the younger ones. After train- 
ing, however, his middle-aged group 
showed the least amount of set, the 
youngest group came next and the 
oldest group did worst. He makes the 
important point that the older the 
group of Ss, the longer the time taken 
to solve the problems both before and 
after training. 

The complexity of the problems 
used has been found to affect Ss dif- 
ferentially according to their age. 
Thus, Clay (1954), using groups of 
64 Ss under 25 and 64 over 55, showed 
that the performance of both groups 
was similar on the simplest task but 
that after an increase in complexity 
the older Ss took more time, were 
less accurate, and less active in cor- 
recting errors. Her Ss were asked to 
arrange counters, each stamped with 
a number from one to four, in rows 
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and columns on a squared board to 
give the marginal totals printed on 
the board. Four problems were given, 
varying from nine squares to 36. 

Fattu, Kapos, Ervin, and Mech 
(1954) found a negative but complex 
relation between age and problem 
solving on a gear train apparatus. 

Kay (1951) too, has found that 
whereas performance deteriorates 
comparatively slightly with age on a 
simple task, the deterioration is more 
marked on a more complex one which 
involved combining the tasks al- 
ready learned. Again both time and 
errors rose, and Kay traces one cause 
of the errors to inability to discard 
wrong procedures previously learned. 
His apparatus consisted of a row of 
10 lights placed above 10 Morse keys, 
to which they could be connected in 
any serial order. His Ss, ranging in 
age from 20 to 70, had first to learn 
two five-light series, to relearn each 
of the series, and then to put them 
together in an alternation test. 

Summary and discussion. Anxiety 
has been found to alter Ss perform- 
ance on rigidity tests, but there is no 
clear-cut relationship holding for all 
Ss and all tests. It is not surprising 
to find individual differences here, and 
the work by Brown (1953) suggests 
that a complex relationship may exist 
between degree of anxiety aroused, 
type of personality, and performance 
on rigidity tests. 

It is suggested that “needs’’ may 
influence the “‘set’”’ of the S and 
hence affect his rigidity on tests 
which happen to be related to the 
‘“‘needs”’ in question. 

Because of the stimulating effect 
of other people it seems likely that 
most people would be more rigid 
when isolated from their fellow-men. 
But it also seems likely that individ- 
ual differences in rigidity would still 
exist under conditions of sensory or 
mental isolation. 

As far as “age” is concerned, it 
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seems that both speed of performance 
and accuracy appear to deteriorate 
with age on problems which require 
thought. The differences become 
more marked the more complex the 
task involved. The cause of errors 
appears to be due in part to the in- 
ability to change “‘set’’ and to look 
for new ways of doing things. There 
is some evidence that training can 
help to inculcate a “set” of looking 
for improvements in methods, though 
the effect of training may not be 
successful with all 


similarly age 
groups. 

There does not appear to have 
been any work on the differential 


effects of age on anxiety and hence 
on rigidity. The effects of various 
training methods on groups of differ- 
ing ages is not known. Ways of 
altering ‘“‘set’’ have not been com- 
pared and it is not known which 
methods are most effective with par- 
ticular age groups. In itself, age is 
also an extra variable, but the writer's 
reason for becoming interested in 
rigidity at all was due to the convic- 
tion that the topic should have some- 
thing to offer in the study of learning 
difficulties among those of middle- 
age and over. While it has become 
evident that fundamental research 
will have to be carried out before 
tests of rigidity can be used in the 
study of age and learning, it may be 
possible to do the two things at once 
to some extent. Most of the gaps 
outlined above will have to remain 
for some time, however. 


DISCUSSION 


There seem to be three possible ap- 
proaches for the experimenter in the 
field of rigidity. 

1. At the moment, each test must 
be taken to stand alone. Failure on 
any one test could be used as an 
operational definition of rigidity. 
For those who wish for rapid results, 
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this may be the only course. Follow- 
ing it seems, however, to have led to 
the muddle in which the topic now 
stands. Indeed, if this approach is 
followed, we may have to call rigidity 
by other names, each specific to the 
area of behavior involved. For ex- 
ample, for learning tasks, it may be 
more useful to talk about ‘‘change of 
expectancy” or ‘‘change of set.”” It 
would be possible to use certain find- 
ings from the general field of rigidity 
—that intelligence and previous ex- 
perience have to be controlled, that 
emotional states of Ss are likely to 
affect performance (very important 
for Ss starting with feelings of in- 
feriority), and that our criterion for 
adaptability must involve a necessity 
for adaptation with penalties for fail- 
ure. 

2. The second possibility would be 
the most fundamental. This would 
be to adopt a theoretical approach to 
rigidity and endeavor to develop a 
test which met the theory in a satis- 
factory way. For example, it has al- 
ready been pointed out (by Cattell) 
that “‘rigidity’”’ has a physical mean- 
ing which has been accurately de- 
fined. The term refers to the linear 
relationship which holds for many 
materials between the deflecting force 
on a bar and the deflection which it 
brings about. This ratio stress/strain 
is called the “coefficient of rigidity.”’ 
It might be possible to apply this 
theory to the study of rigid behavior. 
Apparently no attempt has ever been 
made to measure the extent of any 
“given demand tochange”’ (‘‘stress’’). 
Comparison has always been _ be- 
tween individuals given the same 
test and not between the behavior of 
one individual under two demands 
to change. Ideally, both intra- and 
interindividual reactions to change 
should be compared. The effect of 
different tests on different individ- 
uals is not known, but if there is a 
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wide variation due to other factors 
such as intelligence involved in the 
tests, this may account in part for 
the inconclusive results so far ob- 
tained. The second course may, 
therefore, not be possible until we 
have more knowledge of the composi- 
tion of the individual rigidity tests 
most often used at present, and the 
second and third methods of ap- 
proach seem to be best regarded as 
complementary. 

3. In discussing rigidity it is easy 
to confound behavior with the under- 
lying influences on behavior. We 
have the dichotomy made clear by 
Cattell when he points out that the 
causes of rigid behavior are likely to 
be threefold at least—and to lie in 
lack of intelligence, personality dif- 
ficulties, and ‘‘disposition rigidity.” 
Goldstein too, has emphasized the 
difficulty, pointing out that the rigid 
individual may react by extreme 
flexibility as well as by extreme in- 
flexibility, while the Lewinians openly 
choose to regard rigidity as a cause 
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rather than as a form of behavior. 
Whether ‘‘behavior” or ‘‘cause’”’ will 
be of interest to us depends on the 
general problems which we wish to 
solve. We may start with a piece 
of behavior—say difficulty in learn- 
ing a new job—and attempt to an- 
alyze out the various causes. Or we 
may start with a factor like “‘disposi- 
tion rigidity’’ and study the wide 
number of situations which it will 
affect. Whichever approach we 
adopt, it is essential to have tests 
available which measure rigidity as 
accurately as possible, and a battery 
of tests which may show us how 
many things other than rigidity are 
helping to make up any piece of be- 
havior in which we are interested. It 
is for this reason that it seems desir- 
able, in spite of the many incon- 
clusive results scattered through the 
literature, to advocate the factor anal- 
ysis of a chosen battery of the 
rigidity tests described in this paper; 
work along these lines is now being 
carried out by the writer. 
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DATA DISTORTIONS DUE TO INHERENT 
DIFFERENTIAL SAMPLING 


ALEXANDER M. BUCHWALD 
Indiana University 


Frequently an experimenter is in- 
terested in a phenomenon (either a 
particular response or a relatively 
complex effect) which does not occur 
in all possible instances in his experi- 
ment. In such experiments it is usu- 
ally possible to obtain two different 
kinds of data: relative frequencies of 
occurrence of the phenomenon under 
the various experimental conditions 
and quantitative characteristics of 
those instances of the phenomenon 
which do occur. Quantitative char- 
acteristics include such diverse meas- 
ures as response latencies or ampli- 
tudes, increases in sensory thresholds 
for particular kinds of material, and 
increases in number of errors on a 
task following various treatments. 
This paper will discuss the effects of 
“inherent differential sampling’’ upon 
data in the form of quantitative 
characteristics and will present some 
suggestions on how to avoid errone- 
ous conclusions on the basis of such 
data. 

Inherent differential sampling des- 
ignates the state of affairs which ex- 
ists when the experimental condi- 
tions elicit the phenomenon from 
varying proportions of the Ss, and 
when the Ss who exhibit the phe- 
nomenon under some of the condi- 
tions tend to be unrepresentative of 
the entire sample. Put in a some- 
what different way, inherent differ- 
ential sampling occurs when: (a) 
there are differences among _ indi- 
viduals in the probability of occur- 
rence of the phenomenon, (6) prob- 
ability of occurrence is related over 
individuals to some quantitative 
characteristics of the phenomenon, 
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and (c) the experimental conditions 
vary widely in the sets of probabil- 
ities of occurrence which they en- 
gender in the Ss. Under these cir- 
cumstances some of the experimental 
conditions tend to elicit instances of 
the phenomenon which are sampled 
largely from a restricted portion of 
the distribution of quantitative char- 
acteristics, while others do not. 

To illustrate how inherent differ- 
ential sampling can distort the rela- 
tionship between quantitative char- 
acteristics and experimental condi- 
tions consider a simple hypothetical 
experiment on stimulus generaliza- 
tion. Ss are trained to press a key 
to Stimulus 1, and are then tested 
once each on Stimuli 1, 2, 3, 4, and 5 
which are increasingly dissimilar 
from Stimulus 1 in that order. For 
simplicity assume that all Ss respond 
to Stimulus 1; that no Ss who re- 
spond to any given stimulus fail to 
respond to all stimuli which are more 
similar to Stimulus 1; and that all Ss 
who respond to a given number of 
stimuli have identical latencies. 
Table 1 shows data for this hypo- 
thetical experiment. These latencies 
have been assigned in accordance 
with the following assumptions: (a) 
For each S latencies increase as the 
stimuli become less similar to Stim- 
ulus 1. (8) Individual latencies to 
Stimulus 1 are negatively related to 
the most dissimilar stimulus to which 
S responds. Note that despite As- 
sumption (a) the mean latencies de- 
crease from Stimulus 1 through 
Stimulus 5 and present a gradient of 
‘“‘response strength’’ which is nega- 
tively related to that presented by 
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the relative frequencies of occurrence 
of the response. The decrease in 
mean latency with increasing dissim- 
ilarity from the training stimulus is 
not a necessary consequence of the 
chosen assumptions. Within their 
framework it would have been possi- 
ble to assign latency values so as to 
produce almost any pattern of mean 
latencies. The particular pattern 
produced was selected because it is 
similar to results reported from sev- 
eral studies of stimulus generaliza- 
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is typically a small number, many Ss 
will make no responses to such a 
stimulus. But if rapid responders 
tend generally to have larger Pij's 
than slow responders the rapid re- 
sponders will provide increasingly 
large proportions of the latency data 
as the stimuli become increasingly 
dissimilar from the training stimulus 
and hence as the entire set of Pij’s 
become increasingly smaller. Thus 
the mean response latencies to the 
various stimuli will be affected by 


TABLE 1 


LATENCY DATA FROM A HYPOTHETICAL EXPERIMENT ON STIMULUS GENERALIZATION 


Number of Ss 
Responding 


1 2 
40 300 
5 390.3 429.4 
10 443.1 487.6 
15 480 .6 557.9 
30 509.7 — 
Mean Latency 408.8 408.5 
Proportion of Ss 


Responding 


330.1 


70 


& Baron, 
Rosenbaum, 


Bilodeau, 
1939; 


(Brown, 
Gibson, 


tion 
1951; 
1953). 
The problem posed by inherent 
differential sampling does not depend 
on each S being tested only once on 
each stimulus, nor on a perfect or- 
dering of the stimuli in ability to elic- 
it the response. In any experiment 
of this kind where each S is tested n 
times on each of A stimuli the prob- 
lem can also arise. Let Pij designate 
the probability that the ith S will 
respond to the jth stimulus. For the 
training stimulus, or for closely sim- 
ilar stimuli, the Pzj’s will tend to be 
large and most Ss will exhibit at 
least one response to such stimuli. 
But for stimuli widely different from 
the training stimulus the values of 
Pij are apt to be small and, since n 


Stimulus Number 


5 


369.9 





inherent differential sampling and 
the relationship between the mean 
latencies and the various stimuli 
will be a distortion of the relationship 
that holds for individuals.! 

Before discussing ways of remov- 
ing the effects of inherent differential 
sampling from data it seems advisa- 
ble to give a further example of an 
experiment in which inherent differ- 
ential sampling may be a problem. 
Many such experiments will be those 
in which E£ is interested in testing 
hypotheses of the form: (a) is a cer- 
tain effect produced more frequently 


i In this kind of experiment E sometimes 
assigns an arbitrary value to serve as latencies 
for trials on which no response occurs. Such a 
procedure will distort the relationship which 
exists between latencies and stimulus dimen- 
sions, 
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by some conditions than by others, 
and (6) is the magnitude of the effect 
when it is produced greater under 
certain conditions than under others. 
An example of such an experiment is 
one where F£ is interested in whether 
various ‘‘stress-producing’’ condi- 
tions differ in the extent to which 
they disrupt performance. Having 
each S perform under control condi- 
tions and under one or more of the 
stress conditions may permit E to 
identify Ss whose performance is 
disrupted, e.g., those who make 
more errors under stress than before 
stress. The proportion of Ss show- 
ing the disruption effect under each 
condition can be used to test a hy- 
pothesis of the form of (a). To test 
a hypothesis of the form of (6) it is 
necessary to compare the mean pre- 
and poststress differences for Ss 
showing the disruption effect under 
the various conditions. But this 
measure is subject to the possibility 
of inherent differential sampling. 
Conditions which lead to disruption 
for almost all Ss may elicit small ef- 
fects from Ss who would not exhibit 
a disruption effect under other condi- 
tions, whereas those Ss who exhibit a 
disruption effect under weaker con- 
ditions may in general exhibit large 
disruption effects. 

In some experiments it may be pos- 
sible to test whether inherent differ- 
ential sampling does or does not af- 
fect the results. In the example un- 
der discussion differential inherent 
sampling would lead to unequal vari- 
ances for the magnitude of the dis- 
ruption effect under the different 
conditions. Similarly for the experi- 
ment exemplified in Table 1 differ- 
ences among the subgroups in re- 
sponse latencies to Stimulus 1 would 
indicate that inherent differential 
sampling would be a factor. Unfor- 
tunately a lack of significant differ- 
ences in the latter case would not 
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rule out the possibility of inherent 
differential sampling since a relation- 
ship between the probability of re- 
sponses to generalization stimuli and 
rapidity of responding might still 
exist. 


WAYS OF OVERCOMING DISTORTION 
DvE TO INHERENT DIFFERENTIAL 
SAMPLING 


Since inherent differential sam- 
pling poses problems only in the anal- 
ysis of quantitative characteristics 
the difficulty can be avoided by using 
relative frequency data. However 
hypotheses of interest cannot always 
be tested by such data. As suggested 
in the preceding paragraph there is 
a class of hypotheses which demands 
that £ lay himself open to the vicissi- 
tudes of inherent differential sam- 
pling. 

Even though it may be impossible 
to avoid the presence of inherent dif- 
ferential sampling it may be possible 
to avoid contamination of the data 
from this source if the Ss can be strat- 
ified in such a way that inherent dif- 
ferential sampling does not operate 
within strata. The surest way of ac- 
complishing this is to expose each S 
to more than one, and preferably to 
all, of the experimental conditions. 
When this can be done one assem- 
bles all Ss who exhibit the phenom- 
enon under the same subset of condi- 
tions into a subgroup. Within each 
such subgroup the problem of inher- 
ent differential sampling will not 
exist since each S will contribute data 
under each condition.? Further data 
treatment should be carried out for 
each subgroup separately. If the 


2 In some experiments some of the Ss may 
contribute more data than others, e.g. some 
Ss might respond on more trials under a given 
condition than others. This difficulty is easily 
overcome by using a mean or median value 
for each S if more than one instance of the 
phenomenon can occur for a single S under 


one condition. , 











functional form of the relationship 
between experimental conditions and 
quantitative characteristics is identi- 
cal for each S it will frequently be ob- 
tainable from the data for each sub- 
group, or at least from some of them. 
(The general question of the relation- 
ship between group and individual 
curves is beyond the scope of this 
paper but a discussion of it can be 
found in an article by Estes [1956].) 
Application of this procedure to the 
data in Table 1 would involve treat- 
ing the data on each line separately, 
and such treatment will yield the re- 
lationship used to produce the tabled 
values. 

In some experiments it may be im- 
possible to expose Ss to more than 
one condition. When this is the case 
stratification can be accomplished 
only indirectly, if at all. If there is 
known to be another variable, per- 
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haps a test score, which is related to 
the probability that an individual 
will display the phenomenon, Ss can 
be stratified on this basis, and the 
data for subgroups assembled on the 
basis of similar scores on the external 
variable can be compared. The ex- 
tent to which such a procedure will 
be effective in eliminating inherent 
differential sampling will depend, of 
course, on the degree of relationship 
between the external variable used 
and probability of occurrence of the 
phenomenon. 

Finally, there may be some prob- 
lems for which no amount of experi- 
mental ingenuity will be sufficient to 
design an adequate experiment in 
which Ss can be exposed to more 
than one condition, and where no 
highly correlated external variables 
can be found. For these problems the 
author knows of no existing solution. 
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Factor analysis was developed pri- 
marily by psychologists. Apart from 
the very early debates (Pearson & 
Moul, 1927; Spearman & Holzinger: 
1924, 1925, 1929; Wishart, 1928) 
about the standard errors of tetrad- 
differences, development was along 
mathematical rather than along sta- 
tistical lines. This is apparent from 
Thurstone’s' writings (Thurstone: 
1935, 1947), for example, where lib- 
eral use is made of matrix algebra 
and where (1947, p. 282 et seq.) the 
rank of the side matrix, rather than 
any sampling error considerations, is 
the guiding principle in determining 
the “dimensionality”’ of the common 
factor space. 

The first attempt by a statistician 
to deal with a problem resembling 
that of the factor analyst came in 
1933 with Hotelling’s paper (1933) on 
the ‘Analysis of a Complex of Statis- 
tical Variables into Principal Com- 
ponents,” though, as Burt often 
points out, Karl Pearson had dealt 
with a closely related problem as 
early as 1901 in a paper entitled “On 
Lines and Planes of Closest Fit to a 
System of Points in Space.”’ But a 
principal component analysis, being 
only an empirical method of ‘‘break- 
ing down” a covariance or correla- 
tion matrix into orthogonal com- 
ponents, equal in number to the num- 
ber of variables, must not be con- 
fused—as Bartlett (1953), and as 
Kendall and Lawley (1956) remind 
us—with a factor analysis of the 
same matrix. In a factor analysis, by 
contrast, there is a basic assumption 
that the scores obtained by Ss on the 
variables concerned can be explained 
in terms of a few ‘“‘common factors,” 
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plus a specific factor for each vari- 
able, and this implies that some hy- 
pothesis about the number of “‘fac- 
tors’’ necessary and sufficient to ac- 
count for the covariance between the 
variables is being tested when a factor 
analysis of a covariance or correlation 
matrix is performed. Consequently, 
it was not until Lawley, who had 
become familiar with the factor prob- 
lem proper during his association with 
Godfrey Thomson, produced his clas- 
sic paper in 1940, on ‘‘the estimation 
of factor loadings by the method of 
maximum likelihood,” that a rigorous 
statistical solution of the factor prob- 
lem was available and factor analysis 
became a respectable statistical tech- 
nique. Lawley’s paper was not ac- 
cepted without criticism (Kendall & 
Lawley, 1956; Young, 1941) at first, 
but it served the immediate purpose 
of stating the factor problem in lan- 
guage which mathematical statis- 
ticians could understand, and though 
in the intervening years since 1940 
only a few steps forward have been 
achieved, many statisticians have 
turned their attention to factor 
analysis (Anderson & Rubin, 1955; 
Danford, 1953; Kendall, 1954; Law- 
ley, 1940; Rao, 1955; Rippe, 1953; 
Wold, 1953) attracted as much by the 
complexity of the problems involved 
as by the intrinsic interest of the sub- 
ject itself. 

Previous to Lawley’s 1940 paper, 
and during the subsequent decade— 
while factor analysis was still await- 
ing recognition as an appropriate 
field for statistical research—psy- 
chologists were busy developing the 
subject in an empirical manner. As a 
consequence, a vast number and 
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variety of factorial techniques ap- 
peared in psychometric journals, not 
only for extracting factors and for 
testing the significance of residual 
matrices, but also for rotating factors 
into “psychologically meaningful” po- 
sitions and for comparing factors de- 
rived from different studies. These 
techniques, logically derived or em- 
pirically justified, became the stock 
in trade of practicing factor analysts. 
While they served, on the whole, a 
fairly useful purpose and are likely 
to continue in use in pilot studies 
with small samples, it is probable that 
future historians will be severely 
critical of them and of their users; 
critical of the techniques because of 
their approximate nature and lack of 
precision and statistical efficiency, 
and of their users for extravagant 
claims on their behalf. Indeed, even 
amongst psychologists, who them- 
selves occasionally use factorial tech- 
niques, criticism to date has not been 
lacking (Burt, 1952; Holzinger: 1940, 
1942; McNemar, 1941; Maxwell, 
1956; Reyburn, 1943; Saunders, 1948). 
In view of these salutary rebukes it is 
surprising that such efficient factorial 
techniques as do exist are not more 
frequently employed. This appears 
to be due in part to the intrinsic com- 
plexity of the basic factor equations 
themselves, for which no efficient yet 
simple solution has yet been found: 
and in part to the onerous nature of 
the calculations involved in the rigor- 
ous procedures at present available 
(Howe, 1955; Lawley: 1940, 1953, 
1958; Rippe, 1953). However, with 
the advent of electronic computers 
the latter excuse is no longer valid. 
Consequently this paper aims at 
bringing to the psychologist’s atten- 
tion some (large sample) statistical 
methods which should be of value to 
him. Some of these are only tangen- 
tially associated with factor analysis, 
others are directly associated with it. 
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SIGNIFICANCE OF A CORRELATION 
MATRIX 

Occasionally an investigator (Ey- 
senck, 1952, pp. 12-14) wishes to 
test a correlation matrix for signif- 
icance, that is, he wishes to assess the 
statistical significance of the entire 
correlation structure of his variables. 
This would be especially desirable if 
the intercorrelations proved to be 
very low and it had to be decided 
whether or not a factor analysis of 
the matrix was justified (Bannatyne, 
1953). The appropriate test for the 
significance of a correlation matrix 
was given in 1950 by M. S. Bartlett 
with acknowledgement of earlier 
work by Wilks (1932). It is a chi- 
square test, namely, 


x?= —(n—(2p+5)/6)In| R| [1] 


with $p(p—1) degrees of freedom, 
where N is the sample size and n= 
(N—1), pis the number of variables, 
| R| is the determinant of the correla- 
tion matrix R, and “In”’ stands for 
log,. The laborious part of this calcu- 
lation is the determination of | R}, 
for which an electronic computer 
would be very desirable if p were 
greater than ten or so. However, if 
the approximation 


| R| ate D 37, i<j, 
(i, j,=1, 2,°°: p) 


due to Lawley (1940), is employed, 
where 7;; refers to the correlation co- 
efficients, the labor involved in using 
Formula [1] is greatly reduced. It 
would appear too (Bartlett, 1950, p. 
84), that this approximation is re- 
markably accurate even for moder- 
ately large samples, and the calcula- 
tions could be performed on an or- 
dinary desk calculator. Lawley 
writes the formula in the form, 


x=n Z. 753", i<j, [2] 
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for it would appear that the multi- 
plier —(n—(2p+5)/6) is a bit dubi- 
ous. 


COMPARING TWO VARIANCE- 
COVARIANCE MATRICES 


The need for a valid test of the dif- 
ferences between two or more vari- 
ance-covariance matrices is obvious 
to the factor analyst at least, for if 
two such matrices containing the 
same variables (and them alone) dif- 
fer, then their factorial composition 
must also differ. It is desirable too to 
do such a test on the covariance 
rather than on the correlation mat- 
rices, for a comparison of the latter 
would be invalidated if the sample 
variances, which are employed when 
converting the covariances into cor- 
relations, themselves differ signif- 
icantly. 

The test may be deduced from 
Wilk’s paper (1932). Let the two 
samples concerned have degrees of 
freedom m and mz respectively, i.e. 
the sample sizes are (m,+1), and 
(m2+1). Let A; and Az be the re- 
spective covariance matrices with p 
variates each. Let the ‘‘pooled”’ co- 
variance matrix be A, with (+2) 
degrees of freedom: i.e. A =(m,4,+ 
m2A2)/(m+m2), or the elements of 
Matrix A are weighted means of the 
elements of Matrices A; and Az. 
Then the quantity 


(m+-m2) In | A| —m In | Ay | 
— Mn In | Az [3] 


is distributed, on the null hypothesis, 
approximately as chi square with 
$p(p+1) degrees of freedom, where 
p refers to tests: | A|, | Ai] and | A,| 
stand for the values of the deter- 
minants of the matrices A, A; and 
Az respectively. Considerations by 
Box (1949) show that this test can 
be made more sensitive in the case of 
moderately large samples by apply- 
ing the multiplier, 
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, ee 
6(p + 1) 
x ( 1 4 1 1 ) 
Mm NM, MmtM 
to the expression for chi square given 
above. 

It is easy to see how Expression 
[3] would have to be adjusted if 
k rather than just two covariance 
matrices were being compared: Box 


(1949) actually considers this more 
general case. 


THE SIGNIFICANCE OF RESIDUAL 
MATRICES 

Closely associated with the prob- 
lem of finding efficient estimates of 
factor loadings, which the maximum 
likelihood method provides, is the 
problem of testing the hypothesis 
concerning the number of factors 
necessary to “explain” the covariation 
between the variables. The first 
satisfactory answer to this problem 
was also given by Lawley in his 1940 
paper in the form of a chi-square test 
of the significance of the residual 
matrix after k factors had been fitted. 
The exact formula is 


xt=n In | C|/| A| [4] 
with 3 (p—k)?—3(p+k) degrees of 


freedom, where A is the observed 
correlation matrix and C is the fitted 
correlation matrix given by C=LL’+ 
V, L being the matrix of factor load- 
ings, L’ its transpose, and V the 
diagonal matrix of specific variance 
(inclusive of error variance). The 
evaluation of the determinants of the 
Matrices C and A, namely | C| and 
|A|, each of order p, without elec- 
tronic computing aid, has in the past 
made the use of Formula [4] imprac- 
ticable. However, there is an approxi- 
mation to it, namely, 


xt=n- Do (r:i?/8,2577), i<j, 


(i, j7=1, 2,-++ p) [5] 
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where r; is the residual in the ith row 
and jth column of the residual ma- 
trix, s;? is the specific variance (in- 
cluding error) of the ith test, and 
N=n+1 is the sample size, which 
can be evaluated readily on a desk 
computer. Formula [2] is the limiting 
case of Formula [5] when & is taken 
as zero, that is when no factors are 
postulated and the correlation struc- 
ture is assumed not to differ signif- 
icantly from zero. 

In his 1940 paper Lawley em- 
phasized that the chi-square test 
given by Formulae [4] and [5] should 
not be employed for testing the signif- 
icance of a matrix of residuals except 
in the case where efficient estimates 
of the factor loadings had been ob- 
tained. This ruled out its use, for a 
time, where the centroid method of 
analysis was concerned; latterly this 
restriction has been removed. In a 
paper entitled “A Statistical Exami- 
nation of the Centroid Method,” Law- 
ley (1955) examines the statistical 
efficiency of the latter method of 
analysis (in the special case where 
the communalities are taken as 
given), and obtains a test of signif- 
icance of the residuals. Not only 
was the statistical efficiency of the 
centroid method for the special case 
considered found to be very high, 
but also the significance test proved 
to be exactly that given earlier for 
use with the maximum likelihood 
method. This is an encouraging dis- 
covery, and it is safe to conclude that 
when care is taken with the centroid 
method to ensure that maximum 
variance is extracted with each suc- 
ceeding factor—a result which is not 
always achieved by following Thur- 
stone’s rules of thumb for sign reflec- 
tion, as Thurstone himself knew very 
well—and when a few iterations have 
been performed to ensure that the 
loadings are well on the way to con- 
vergence, then Formula [5] above 
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provides a valid and efficient test of 
the significance of the residual ma- 
trix. 

Lawley’s approximate chi-square 
test would then appear preferable for 
use with the centroid method even to 
the general test of ‘the complete- 
ness of factor solutions’ given by 
Rippe (1953). Rippe’s test has not 
been widely used to date primarily 
because it involves the laborious task 
of inverting a large covariance ma- 
trix. But even if we assume that with 
electronic computers this is no longer 
an obstacle, there is also the fact, 
which Rippe points out, that the tests 
given above are more powerful than 
his. 

Previous to Rippe’s and Lawley’s 
work, psychologists, using the cen- 
troid method, had to be content with 
empirical tests for deciding the ques- 
tion of “how many factors?” Of the 
numerous empirical tests in current 
use two may be mentioned—one due 
to McNemar (1942), and one due to 
Burt (1952). McNemar’s test is too 
widely known to need description 


here. Burt’s is a chi-square test, 
namely, 

x*=(N—3) 2) (s—2)*, [6] 
with 4(p—k)?—43(p+k) degrees of 


freedom, where N is the sample size, 
p the number of variables and k the 
number of factors extracted; z refers 
to Fisher’s transformation of the cor- 
relation coefficient and 2 to a similar 
transformation of the amount of the 
correlation accounted for by the fac- 
tors extracted. 

At first sight, Burt’s test might ap- 
pear to have considerable validity. 
Justifying it he states (1952, pp. 
126-127) that since ‘‘the distribution 
of 2x/(N—3) is nearly normal, with 
unit deviation, it follows that on 
summing the squares for all the dis- 
crepancies and multiplying the result 
by (N-—3), we obtain a quantity 
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which will be distributed approxi- 
mately as chi-square.”’ But this is a 
good example of how deceptive em- 
pirical and approximate methods can 
be. For one thing, it is likely that z 
and Z would be correlated, in which 
case a quadratic form of these ex- 
pressions would be required in For- 
mula [6]; then again nothing is shown 
about the distribution of the ‘‘corre- 
lations’ as reconstructed from the 
factor loadings, so that we do not 
know whether the z-transformation is 
appropriate or necessary. As a check 
on the efficiency of Burt’s test the 
value of chi square obtained by it for 
11 residual matrices (Maxwell, 1957) 
from studies for which the value of 
chi square obtained by Formula [5] 
above had previously been found, 
were compared. The figures are 
given in Table 1. In the last column 
of the table an L is entered when 
McNemar’s test agrees with Law- 
ley’s test, and a B when it agrees 
with Burt’s test; in other cases a 
sufficient number of factors had not 
been extracted to decide the issue. 


TABLE 1 
VALUES OF CHI SQUARE OBTAINED 
BY LAWLEy’s Exact TEST, AND 
Burt’s APPROXIMATE TEST 











Lawley’s Burt's eo 
Sample df Test SOS 
i" 2 Test* 

1 24 129.4 94.5 —_— 
2 16 63.3 21.4 L, 
3 13 24.2 10.4 i 
4 13 20.3 8.5 — 
5 26 43.4 23.8 5 
6 26 50.9 23:5 i 
7 24 238.3 83.0 — 
8 16 168.9 42.8 _— 
9 13 28.9 10.2 B 
10 13 29.0 10.4 # 
11 12 91.0 71.7 B 





* L indicates that McNemar’s Test agrees 
with Lawley’s and B that it agrees with 
Burt's. 
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This table shows that the results 
given by Burt’s approximate test 
are of a different order from those 
given by Lawley’s exact test, and 
in each case grossly underestimate . 
the value of chi square. McNemar’s 
test shows up in a slightly better 
light. 


VARIANCES AND COVARIANCES OF 
EsTIMATED LOADINGS 


A simple method of estimating the 
standard errors of factor loadings 
would be of inestimable value to 
factor analysts, but unfortunately 
none has yet been found. The first 
serious attempt to meet the need 
appeared in 1949 in a paper entitled 
“problems in factor analysis,” again 
by Lawley (1949). The expressions 
involved were very manageable and 
were used by Emmett (1949) in a 
study reported in the same year. Un- 
fortunately, it was later revealed that 
these expressions are not wholly satis- 
factory, for ‘‘the estimated loadings 
obtained for a given set of tests in 
one factor are slightly correlated 
with estimated loadings for the same 
set of tests in another factor.” 

In 1953 Lawley returned to the 
problem of the estimation of the vari- 
ances and covariances of factor load- 
ings in his Uppsala paper on “A 
Modified Method of Estimation in 
Factor Analysis and Some Large 
Sample Results,” where new expres- 
sions are given for estimating the re- 
quired quantities. Unfortunately, 
these expressions are not only diffi- 
cult and cumbersome, due to the 
complexity of the problem, but are 
of limited practical value in that they 
are subject to the restriction that the 
specific variances (or their ratio) of 
the variables concerned are assumed 
to be known. As such knowledge is in 
general denied us we are still in the 
position of having no valid method 
for estimating the standard errors of 
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factor loadings. This is a serious gap 
in the statistical theory of factor 
analysis. Lacking knowledge of the 
magnitude of the standard errors of 
factor loadings we are not in a posi- 
tion to decide in any particular case 
if a specific loading is significantly 
different from zero. This means that 
claims by factor analysts that they 
have, for example, “rotated their 
factors to simple structure,” carry 
with them little conviction, for ‘‘sim- 
ple structure” implies that certain 
loadings are zero. If it proves to be 
the case, as McNemar (1941) and 
others have suggested, that the 
standard errors of loadings, especially 
with small samples, tend to be large, 
that is, that the precision of most fac- 
tor studies is low, then the ‘‘confi- 
dence limits’’ of any given ‘“‘simple 
structure”’ are likely to be wide, and 
as a result that simple structure may 
readily be achieved. Under condi- 
tions of low precision, however, such 
an achievement is of little worth. 
Consequently, and in view of the fact 
that the “simple structure’ concept 
itself does not seem capable of rigo- 
rous mathematical or statistical defi- 
nition (Uppsala Symposium on Psy- 
chological Factor Analysis, 1953, p. 
77 et seq.), it is doubtful whether it is 
worthy of retention as a precise con- 
cept in a valid and efficient statistical 
theory of factor analysis. This recom- 
mendation, which in some quarters is 
likely to be considered irreverent if 
not downright heretical, is made on 
the strength of a noteworthy advance 
in the statistical theory of factor 
analysis, by Anderson and Rubin 
(1955) and by Howe (1955). 


‘ 


Factor IDENTIFICATION AND 
FACTORIAL INVARIANCE 


In the contributions just referred 
to, the problems involved in the esti- 
mation of factor loadings, under vari- 
ous different assumptions, are dis- 
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cussed, as also is the problem of de- 
ciding on the number of factors. 
Howe, too, devotes valuable space to 
the practical issues involved in the 
solution of the fundamental factor 
equations; in particular to the itera- 
tive processes most likely to be of ad- 
vantage. But our reference to the 
work of Anderson and Rubin and of 
Howe is a specific one. 

It is well known that though the 
question of the number of factors 
necessary to explain the correlations 
in a matrix can now be answered, this 
does not enable the factor loadings to 
be defined uniquely. The factor axes 
themselves may be rotated into an in- 
numerable number of different posi- 
tions within the factor space, any one 
of which is as good as any other as far 
as the reproduction of the correlation 
matrix is concerned. The position 
finally chosen in any study is that 
which the analyst thinks is most 
meaningful from a_ psychological 
viewpoint. The arbitrariness of such 
a procedure has already been noted. 
The important additional contribu- 
tion which the writers just mentioned 
have made is this: if previous to an 
analysis the experimenter can say 
that, for the factors postulated, cer- 
tain factor loadings will be zero, that 
is if he can specify a particular “‘sim- 
ple structure,”’ and if these zeros are 
sufficiently numerous and conven- 
iently placed, these conditions are 
sufficient to determine the factor so- 
lution completely and validly. Howe 
gives a worked example. 

Unaware of the researches of Howe 
and of Anderson and Rubin, Lawley 
(1958) was busy on the same problem 
and has produced a solution very 
similar to that given by Howe. 

The practical difficulty for the psy- 
chologist of postulating in advance 
not only the number of factors which 
he expects from an analysis, but also 
of specifying the positions of the zero 
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loadings, cannot be underestimated; 
but, as Howe points out, in cases 
where he is in doubt it would always 
be possible, by dividing his sample 
randomly in half, to carry out two 
parallel studies. He could then ex- 
periment with rotations of the factors 
derived from one of the studies and, 
using the information thus derived, 
set up hypotheses regarding the num- 
ber and positions of the zero loadings, 
which he could then cross-validate on 
the data from the parallel study. 

The justification for recommending 
procedures such as that outlined by 
Howe, which involve long and rather 
tortuous calculations lies—as_ sug- 
gested at the beginning of this pa- 
per—in the computational facilities 
which electronic machines now offer. 
However, these procedures are com- 
mendable primarily on the grounds 
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that they have a valid statistical ba- 
sis. The time surely has come for fac- 
tor analysts to assemble and use the 
few precision instruments available 
to them, at least in research work pre- 
pared for publication, though in pilot 
studies the less precise instruments 
are likely to continue to be of value. 


SUMMARY 


Some exact large sample statistical 
techniques for factor analysts are re- 
viewed. A recent investigation of the 
efficiency of the centroid method of 
analysis is noted, and a recommen- 
dation is made regarding a valid test 
of the significance of residual mat- 
rices when this method is employed. 
The “simple structure’’ concept is crit- 
icised and an alternative approach 
in the search for psychologically 
meaningful factors is recommended. 
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THE “EIDETIC IMAGE” AND “HALLUCINATORY” 





BEHAVIOR: 


A SUGGESTION FOR FURTHER RESEARCH 


THEODORE XENOPHON BARBER! 
Department of Social Relations, Harvard University 


After an extensive series of inves- 
tigations, Jaensch (1930) concluded 
that the ‘‘eidetic image’”’ is neither 
afterimage nor ‘‘memory image”’ but 
shares some of the characteristics of 
both. Although many other investi- 
gators (e.g., Busse, 1920; Meenes & 
Morton, 1936; Peck & Hodges, 1937) 
accepted this conclusion, Allport 
(1928) raised a dissenting voice, con- 
cluding from his own experiments 
and from a review of the literature 
that the ‘‘eidetic image”’ is a vivid 
“memory image.’’ To complicate 
matters further, Morsh and Abbott 
(1945) recently reported, after an in- 
vestigation utilizing 700 Ss, that the 
“eidetic image’ is a type of after- 
image. Investigators have not only 
disagreed on the interpretation of this 
phenomenon, they have also disa- 
greed on the primary data. Some in- 
vestigators report that practically all 
individuals in a ‘‘large’’ population— 
e.g., Fischer and Hirschberg’s (1924) 


while other investigators report that 
none of their Ss, in an equally large 
population, this type of 
“imagery.’’ Nevertheless, we can ac- 
cept all of these contradictory re- 
ports: investigators have subsumed a 
number of phenomena under the 
general rubric of the “eidetic im- 
age’! In some cases it is unlikely 
that the term has referred to any 
type of imaginal behavior at all; in 
other cases, the ‘‘eidetic image’’ has 
clearly referred to a negative or posi- 


possess 
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tive afterimage; in still other in- 
stances, the term has referred to 
what can best be conceptualized as a 
type of “hallucinatory’’ behavior. 
Investigations of this phenomenon 
usually proceeded as follows: (a) The 
experimenters first had the Ss experi- 
ence physiological afterimages. Ac- 
cording to E. R. Jaensch (1930), this 
was done in order ‘‘to demonstrate to 
the subjects exactly what it means to 
see something, although no object is 
actually present” (p. 4). (b) The Ss 
were then asked to look at a picture 
for a specified period of time—e.g., 
15 seconds—and then to look at a 
nearby gray ‘‘projection”’ screen and 
to report what they ‘‘saw”’ there. 
(c) If the Ss stated that they ‘‘saw”’ 
the original picture on the “‘projec- 
tion’ screen and gave other behav- 
ioral indications that they actually 
‘“‘saw’’ something there, they were 
classified as Eidetiker, i.e., as indi- 
viduals possessing ‘‘eidetic imagery.”’ 
This procedure immediately raises a 
serious question. Did the Ss— 
who were almost always elemen- 
tary school children—state that they 
‘“‘saw’’ something on the screen in or- 
der to please the experimenter? Did 
some children report that they “saw” 
the picture there because they were 
perfectly aware, from the nature of 
the instructions, that the experimen- 
ter expected them to say that they 
“saw’’ the picture on the ‘“‘projec- 
tion’ screen? More than one investi- 
gator (Kliiver, 1928; Morsh, 1945; 
Schwab, 1924) has emphasized the 
ambiguous nature of the word “‘see,”’ 
especially as it is used by children. 
Given the ambiguous nature of this 
term, Morsh and Abbott (1945) cor- 
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rectly point out that when “‘Jaensch 
recommends telling the child subject 
that something ‘must be seen’... it 
is expected that most children will 
‘see’ something.’’ In fact, in their 
own investigation, Morsh and Abbott 
(1945) found that when the children 
stated that they ‘‘saw’’ many minute 
details of the picture, they later ad- 
mitted that they “saw” them in their 
mind and not on 
screen. 

If some children actually did ‘‘see’’ 
something on the screen, how do we 
know that what they ‘‘saw’’ was not 
an afterimage? Since, in many in- 
stances, the children were instructed 
only to “‘look”’ at the original picture, 
there was nothing to prevent them 
from fixating a point and subsequent- 
ly experiencing an afterimage. All- 
port (1928), Koffka (1923), and 
other investigators (Schroff, 1926; 
Scola, 1925) report that this often oc- 
curred. Allport (1928), for example, 
writes that what some investigators 
have called an “eidetic image’’ is 
‘without doubt often an after-image 
(since it ‘can be produced through fix- 
ation’).”” E. R. Jaensch (1930) has 
also noted that in some cases ‘‘eidetic 
images”’ are difficult to differentiate 
from afterimages; e.g., “In 
cases in which the imagination has 
little influence, they are merely modi- 
fied after-images.”’ In fact, 


the ‘projection’ 


those 


the 


eidetic images’”’ of one large group 


of Eidetiker—the so-called ‘T-type’ 
(W. Jaensch, 1926)—are difficult, if 
not impossible, to distinguish from 
physiological afterimages; both the 
afterimage and this type of “‘eidetic 
image’ are indistinct, cannot be al- 
tered in form and color at will, usu- 
ally follow Emmert’s law, and usually 
show the complementary color. 

In some instances, however, Ss 
state that they “‘see,’’ and behave as 
if they “see,”’ something on the ‘‘pro- 
jection’”’ screen when the situation 
precludes their ‘“‘seeing’’ an after- 
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image. Investigators report that it is 
not always necessary for the “‘eidet- 
ic’ S to first look at a_ picture 
(Kliiver, 1926). Not only do ‘‘spon- 
taneous [eidetic] images occur’’ (W. 
Jaensch, 1921), but some individuals 
are ‘‘able after hours, days, and even 
months and years to reproduce an 
eidetic image with all its previous 
vividness’”” (Kroh, 1922) and, at 
the ‘‘eidetic image’’ even 
‘takes an obsessive character and 
recurs without volition’ (Allport, 
1924). Purdy (1936) reports that his 
“eidetic’’ S is able, at practically any 
time, to “see’’ a vivid, three-dimen- 
sional ‘‘eidetic image”’ of any person 
or object; in fact, the S’s ‘“images’’ 
are often so vivid that they suppress 
the actual surroundings. In addi- 
tion, this S can ‘see’ an ‘“‘eidetic 
image’ of a man devoid of a head, 
can “see” green leaves upon barren 
winter trees, can ‘‘see’’ a smooth- 
shaven man with a full beard, can 
‘“‘see’”’ objects as altered in size and 
position, etc. There is no essential 
difference between this S’s ‘‘eidetic 
images’? and what has been termed 
since the last century as “‘the waking 
hallucinations of healthy persons’’ 
(Parish, 1897) and as the ‘“nega- 
tive and positive hallucinations” of 
“good” hypnotic Ss (Barber, 1958b; 
Barber, in press). As McDougall 
(1929) and Smythies (1956) have 
pointed out, if an S states that he 
and behaves as if he ‘‘sees,”’ 
a person or an object which has 
many if not all of the characteristics 
of an actual object, when that “ob- 
ject” is by no means present to other 
observers, the S is carrying out “hal- 
lucinatory’’ behavior in the strict 
sense of the term, even though he re- 
mains perfectly aware that the ‘‘ob- 


times, 


‘ 


“ oy 
sees, 


ject’’ is his own creation. To say that 


Purdy’s S ‘‘knows”’ that what she 
‘‘sees’’ does not actually exist, while 
the “‘good’’ hypnotic S ‘‘believes’’ 
that his “hallucinations” are existen- 
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tial realities, may be perfectly cor- 
rect (Barber, 1958a) ; however, this is 
a secondary characteristic of the be- 
havior which we deduce after the be- 
havior has occurred. During the 
event itself—i.e., when Purdy’s S is 
“seeing” the “eidetic image’ and 
when the ‘‘good”’ hypnotic S is ‘‘see- 
ing” the “hallucination’’—the be- 
havior in both cases is essentially the 
same. 

Similarly, when W. Jaensch (1926) 
discusses the “‘eidetic images’’ which 
can be induced by mescaline, there is 
no way that we can differentiate 
these so-called ‘‘eidetic images’’ from 
what other investigators (Beringer, 
1927; Marshall, 1937; Stockings, 
1940) have termed the “visual hal- 
lucinations’”’ which can be induced 
(in some Ss) by this drug. Also, when 
Jaensch reports that the ‘‘eidetic im- 
age” of a color is often followed by 
its appropriate negative afterimage, 
he is reporting the same behavior 
which has been conceptualized, since 
1888, as the “‘color hallucinations” of 
hypnotic Ss (Binet & Féré, 1888) 
and, more recently, as the “halluci- 
nated colors” of ‘‘normal’’ Ss (Bar- 
ber, in press). In fact, it is difficult, 
if not impossible, to differentiate the 
second large group of Eidetiker— 
the so-called ‘‘B-type’’ (W. Jaensch, 
1921)—from individuals who can 
“hallucinate” at will; in both cases, 
the S is not only able to call up an 
“image” of an object or person and 
to banish it whenever he desires but 
he is also able to alter its form, color, 
duration, and location at will. 

Even if we agree that behavior 
should not be termed “‘hallucinatory”’ 
unless the S ‘‘believes’’ that the 
“hallucinatory’”’ object is a real ob- 
ject, we can still insist that some 
cases of “‘eidetic imagery’”’ cannot be 
differentiated from a type of “‘hallu- 
cinatory”’ behavior; as E. R. Jaensch 
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(1930) writes, “In exceptionally 
strong cases... eidetic images. and 
real objects can under certain condi- 
tions be confused with one another”’ 
(p. 18). Fischer and Welke (1926) 
also note this difficulty and empha- 
size that “hallucinations” should 
be classified into three categories: 
(a) non-psychogenic hallucinations, 
(b) psychogenic hallucinations, and 
(c) “eidetic images’’ with ‘‘reality- 
character.”’ 

If the term “‘eidetic image’’ has 
subsumed more than one type of be- 
havior, the contradictory data and 
conclusions of investigations in this 
area become understandable. Fur- 
thermore, if the term has referred, in 
some instances, to a type of “‘hallu- 
cinatory” behavior, further investi- 
gations of this phenomenon could 
provide a solid basis for a general 
theory of “hallucinations.” How- 
ever, instead of asking Ss to look ata 
picture (and then to report what they 
“‘see’”’ on a “projection” screen)— 
thus allowing the Ss to experience an 
afterimage from the original picture 
—we should directly ask them first to 
“imagine’’ an object and then to at- 
tempt to “‘project’”’ the imagined ob- 
ject at some point in the room. From 
Galton’s investigation (1883) during 
the last century, from the experi- 
ments on “eidetic imagery’ and 
“hallucinatory” behavior (Martin, 
1915) during this century, and from 
the writer’s recent experiments (Bar- 
ber, 1957; Barber, in press) with 
“‘good”’ hypnotic Ss—who in many 
cases were able to carry out this type 
of behavior long before they were ever 
“‘hypnotized’’—we can expect to find 
a ielatively small proportion of 


adults and a larger proportion of chil- 
dren who “‘hallucinate”’ at will—i.e., 
who will, during the experiment, 
(a) report that they have “‘projected”’ 
the imagined object, (0) insist—when 
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critically examined—that it does not 
differ at all, or differs only in some re- 
spects, from an actual object, and 
(c) behave, overtly and physiolog- 
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ically (e.g., alpha blocking on the 
electroencephalogram, alteration in 
pupil size) as if they are actually 
“seeing’’ an existential reality. 
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A NOTE ON “CORRECTING 





PERSONALITY SCALES FOR 


RESPONSE SETS OR SUPPRESSION EFFECTS” 
HAROLD WEBSTER 
University of California, Berkeley 


I am grateful to Samuel Messick, 
of Educational Testing Service, for 
pointing out an error in “Correcting 
Personality Scales for Response Sets 
or Suppression Effects,’ which ap- 
peared in this journa! for January, 


1958. The second part of Formula 
[4] should read 
Se . 5. 
——— rr, instead of ——— rr:. 
Jt Sr 


Messick also notes that Formula 
[7] can be improved by substituting 
rrcersy for the regression coefficient b. 
This improved version of [7] may 
be derived using several approaches, 
but the most general is probably to 
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consider 7’=7T—Obt a special case of 
Mosier’s (1943) formula for the reli- 
ability of a weighted composite. It 
seems necessary in any of the meth- 
ods, including the application of 
Mosier’s formula, to regard 6 as a 
constant, thereby ignoring its sam- 
pling variance; in any case the sam- 
pling fluctuations of b are likely to be 
small in comparison with the other 
variations which are allowed for in 
the improved formula. 
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